Generate Statistical Q/a (Including Vis.) From Table with Provenance
github.comΒ·3dΒ·
Discuss: Hacker News
Flag this post

StatQA

StatQA is a modern Python framework for automatically extracting structured facts, statistical insights, and multimodal Q/A pairs from tabular datasets. It converts raw columns and values into clear, human-readable statements paired with rich visualizations, enabling rapid knowledge discovery, CLIP-style multimodal RAG corpus construction, and LLM training.

🎯 Key Features

  • πŸ“‹ Flexible Metadata Parsing: Parse codebooks from text, CSV, or PDF formats

  • πŸ€– LLM-Powered Enrichment: Automatically infer variable types and relationships

  • πŸ“Š Comprehensive Statistical Analysis:

  • Univariate: descriptive statistics, distribution tests, robust estimators

  • Bivariate: correlations, chi-square, group comparisons with effect sizes

  • Temporal: trend detection (Mann-K…

Similar Posts

Loading similar posts...