Redfin, one of the largest US real estate brokerages, publishes its aggregate market data as a free public dataset — a rare and valuable resource for researchers, developers, and investors. ZipMarketData ingests this data daily to power its API.

What Redfin's Public Dataset Contains

The dataset covers market statistics at the ZIP code and city level, including:

  • Median sale price and median list price
  • Median days on market
  • Active listings count
  • Homes sold count
  • Sale-to-list price ratio
  • Year-over-year changes for each metric

Data is published as TSV.GZ files to Redfin's public S3 bucket and updated approximately monthly, covering records back to 2012 for most markets.

Coverage and Limitations

Redfin's market data covers markets where Redfin operates as a brokerage — approximately 3,000+ cities and 28,000+ ZIP codes across the US. Markets where Redfin has minimal presence may have sparse data, particularly rural ZIP codes. ZipMarketData flags data confidence levels in API responses and provides regional estimate fallbacks for thin-data ZIP codes.

DisclosureRedfin's data represents homes listed or sold through Redfin's platform and MLS partners, not every transaction in the market. It is representative but not exhaustive — county recorder data is the ground truth for all transactions.

Why This Matters for API Reliability

Because ZipMarketData uses an authoritative public-domain source (Redfin publishes with explicit redistribution permission), the data pipeline runs daily without API keys, scraping fragility, or proprietary licensing concerns — keeping your data fresh and legally clean.