Kids101 | Txt

A dataset containing over 60,000 poems written by children in grades 1 through 12, often used for age classification and sentiment analysis.

Do children texts hold the key to commonsense knowledge? - HAL Kids101 txt

LLMs often struggle to limit their vocabulary to age-appropriate levels. This research develops a dataset and pipeline for fine-tuning models specifically to simplify and generate stories for younger age groups. 4. Notable Children's Text & Speech Datasets A dataset containing over 60,000 poems written by

Depending on your focus, here are the most relevant academic papers and datasets involving children's text and AI: This research develops a dataset and pipeline for

"Do children texts hold the key to commonsense knowledge?"

This paper discusses the unique linguistic needs of children and provides early insights into developing specialized language models ( KidLM ) that are safer and more pedagogically appropriate for young users. 3. Automatic Story Generation & Simplification