ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction Paper • 2602.15189 • Published Feb 16 • 4