GradeThread
Denim jacket laid flat beside a tablet showing an AI clothing inspection grade report on a white surface

AI Clothing Inspection: How Computer Vision Grades a Garment From Photos

By GradeThread Team · ·8 min read
condition-gradingtrust-psychologydefect-taxonomygrading-photographyai-tools

AI Clothing Inspection: How Computer Vision Grades a Garment From Photos

GradeThread's AI clothing inspection reads a set of photos and returns a 1.0–10.0 grade, a factor-by-factor condition report, and a verifiable certificate — typically in under two minutes. Here is exactly how that works, where the model is confident, and where a human steps in.

Why "I'd say it's Excellent" Is a Business Problem

Every reseller has a mental condition scale. The problem is it shifts. The Levi's 501 you graded Excellent on a Tuesday after a good sourcing run gets graded Very Good on a Thursday after three returns. Buyers can't see inside your head, and neither can the next person who lists from your consignment pile.

Standardized AI grading fixes that by applying the same criteria, weighted the same way, every time. No fatigue, no optimism bias, no "it looks fine in this light." The output is a numerical score tied to named tiers — NWT, NWOT, Excellent, Very Good, Good, Fair, Poor — with the evidence logged.

That matters for returns. A buyer who disputes a "not as described" claim on eBay has to argue against a timestamped report, not against your memory of what the jacket looked like three weeks ago.

The Five Factors the Model Evaluates

GradeThread grades every garment across five weighted factors. The model doesn't produce a single holistic score from a vibe read — it scores each factor independently, then combines them into the final 1.0–10.0 grade. The factors are:

  1. Fabric Condition — Surface integrity of the textile itself. The model looks for pilling density (number and clustering of bobbles per square centimeter of visible fabric), fading gradients across panels, fiber distortion, and thinning in high-friction zones like inner thighs on denim or underarms on knits. A garment with dense Stage 3–4 pilling across the chest drops here regardless of how clean it is.
  2. Structural Integrity — The garment's ability to hold its shape and hold together. Computer vision checks seam alignment, seam separation (visible gap between joined panels), seam stress marks (thread tension lines that precede a pop), collar roll consistency, and shoulder structure on tailored pieces. A blazer with a collapsed shoulder pad scores lower here even if the fabric is pristine.
  3. Cosmetic Appearance — Visible surface defects that don't affect wearability but affect perceived value. Stains, discoloration, crocking, hardware tarnish, pen marks, and print cracking on graphic tees all land here. The model classifies stain size relative to the garment panel and location — a 2cm stain on a back hem scores differently than the same stain on a front chest panel.
  4. Functional Elements — Hardware and closures. Zippers, buttons, snaps, drawstrings, and elastic. The model checks zipper pull presence, pull alignment, visible teeth damage, button completeness against visible buttonholes, and elastic deformation in waistbands and cuffs. A missing button on a hidden placket scores differently than a missing button on a visible front placket.
  5. Odor & Cleanliness — This is the one factor computer vision cannot directly assess. More on how GradeThread handles it below.

What the Model Is Actually Seeing: Computer Vision in Plain English

The underlying approach is a convolutional neural network trained on hundreds of thousands of annotated garment images. "Annotated" means a human grader looked at each image and labeled what they saw: pilling location and density, seam stress indicators, stain type and placement, hardware condition. The model learned to recognize those patterns by seeing enough labeled examples that it can generalize to new photos.

In practice, when you upload photos, the model runs several detection passes:

Photo quality matters here. A single flat-lay shot in mixed indoor light gives the model much less to work with than four shots — front, back, detail, label — taken under diffuse daylight or a softbox. That is not a limitation unique to AI; a human grader working from one blurry photo would also hedge.

Why Low Confidence Scores Go to Human Review

When any factor score carries a confidence value below 0.75, GradeThread routes that garment to human review before the final grade is issued. This is not a fallback for when the AI "fails" — it is the designed behavior for genuinely ambiguous cases.

The most common triggers:

The human reviewer sees the same annotated images the model flagged, the model's proposed grade, and the specific factor that triggered review. They confirm, adjust, or override. The final certificate records whether the grade was AI-only or AI-plus-human-review — buyers can see that distinction.

This matters for trust. A grade issued at 0.91 confidence across all five factors means something different than one issued at 0.62 confidence on Structural Integrity. Surfacing that difference is more honest than hiding it behind a clean number.

The Odor Problem: What AI Can't See and How It's Handled

Odor & Cleanliness is the one factor that cannot be graded from a photo. A camera has no olfactory sensor. A jacket that looks immaculate in photos can carry cigarette smoke, mildew, or heavy fragrance — all of which are grounds for a return and all of which fall under eBay's "not as described" policy.

GradeThread handles this with a structured self-report protocol. When you submit a garment for grading, you answer a short odor assessment checklist:

That self-report feeds directly into the Odor & Cleanliness factor score. A "strong smoke" response drops the garment to a Fair or Poor tier on that factor regardless of how the other four score. A "faint fragrance" on an otherwise Excellent garment might hold a 7.5 overall but will be disclosed in the condition report so the buyer knows what to expect.

The honest answer here is that self-report introduces some subjectivity back into a system designed to remove it. Two resellers might rate the same mildew smell differently. We know that. It is still better than no odor disclosure at all, and it creates a paper trail: if a buyer returns a garment claiming smoke smell and the seller reported "no odor," that discrepancy is documented.

Grade Tiers: What the Final Number Means

The five factor scores combine — with different weights by garment category — into a single 1.0–10.0 grade that maps to a named tier. The tier vocabulary is fixed:

Grade Range Tier What It Means
10.0 NWT New with original tags attached, no signs of wear or washing
9.0 NWOT New condition, tags removed, no wear or washing evidence
8.0–8.5 Excellent Worn minimally; no visible defects under normal inspection
7.0–7.5 Very Good Light wear; minor cosmetic signs only, fully functional
6.0–6.5 Good Moderate wear; defects present but disclosed and proportionate to price
5.0–5.5 Fair Heavy wear or significant defects; functional but clearly used
3.0–4.5 Poor Major defects, damage, or odor; parts or upcycle value only

Category weights shift the math. On a vintage denim jacket, Fabric Condition and Cosmetic Appearance carry more weight because fading and wear patterns are both the primary value drivers and the primary return triggers. On a tailored wool blazer, Structural Integrity carries more weight because a collapsed shoulder or a popped lining seam is a functional failure, not just a cosmetic one.

What AI Grading Actually Gets You as a Reseller

The practical upside is not that the AI is infallible. It is that the grade is documented, consistent, and shareable.

When you list on eBay and attach a GradeThread certificate link, a buyer can verify the grade before they purchase. That shifts the burden of proof in a dispute. eBay's "not as described" process asks whether the item matches the listing. A listing that references a timestamped, factor-level condition report is a much stronger position than a listing that says "Excellent condition — see photos."

On Poshmark, where condition disputes often happen in the comments or offer stage, having a shareable certificate URL answers "what condition is this really?" without a negotiation. On Whatnot, where you're grading live, being able to say "this is a 7.5 Very Good, here's the report" moves faster than reading out a paragraph of description.

It also protects you from your own optimism. The jacket you sourced for $18 and love is not automatically Excellent. The model doesn't know what you paid for it.

Try It on One Garment

The fastest way to understand what AI clothing inspection actually catches is to run a garment you've already graded yourself and compare the output. Pick something you called Excellent or Very Good. Upload four photos — front, back, close detail on any wear area, and the label. See where the model lands and which factor it flags.

If it agrees with you, you have documentation. If it doesn't, you have information worth having before a buyer does.

Grade your first garment at GradeThread — no commitment, one report, see exactly what the model sees.

Grade a garment with GradeThread →
Save to Pinterest