Redfin, one of the largest US real estate brokerages, publishes its aggregate market data as a free public dataset — a rare and valuable resource for researchers, developers, and investors. ZipMarketData ingests this data daily to power its API.
What Redfin's Public Dataset Contains
The dataset covers market statistics at the ZIP code and city level, including:
- Median sale price and median list price
- Median days on market
- Active listings count
- Homes sold count
- Sale-to-list price ratio
- Year-over-year changes for each metric
Data is published as TSV.GZ files to Redfin's public S3 bucket and updated approximately monthly, covering records back to 2012 for most markets.
Coverage and Limitations
Redfin's market data covers markets where Redfin operates as a brokerage — approximately 3,000+ cities and 28,000+ ZIP codes across the US. Markets where Redfin has minimal presence may have sparse data, particularly rural ZIP codes. ZipMarketData flags data confidence levels in API responses and provides regional estimate fallbacks for thin-data ZIP codes.
Why This Matters for API Reliability
Because ZipMarketData uses an authoritative public-domain source (Redfin publishes with explicit redistribution permission), the data pipeline runs daily without API keys, scraping fragility, or proprietary licensing concerns — keeping your data fresh and legally clean.