: Details product attributes across various categories, enabling analysis of consumer behavior across the country. Use Cases for the 100k Dataset
: Includes raw text from customer reviews, making it a prime candidate for Natural Language Processing (NLP) and sentiment analysis.
: Using the review text to fine-tune AI models for Portuguese-specific sentiment or linguistic nuances. Technical Details & Formats Download 100k Brazil txt
: Primarily in Portuguese , which is why it is often paired with speech-to-text or translation resources like Wav2vec models for Brazilian Portuguese .
While the official release is typically in CSV format, many GitHub repositories and data science platforms host .txt or .parquet conversions for specific programming needs. Technical Details & Formats : Primarily in Portuguese
: The Official Olist Dataset on Kaggle is the most verified source.
This dataset is particularly popular for developers and researchers because it provides a multi-dimensional view of the e-commerce lifecycle: This dataset is particularly popular for developers and
: Relates Brazilian zip codes to latitude and longitude coordinates, allowing for complex logistics and delivery time mapping.