Learn Before
Corpora for Simplification
-
Main-Simple English Wikipedia: The Simple English Wikipedia (SEW) is a version of the online English Wikipedia (EW) primarily aimed at English learners, but which can also be beneficial for students, children, and adults with learning difficulties. With this purpose, articles in SEW use fewer words and simpler grammatical structures.
-
Newsela Corpus: It contains 1,130 news articles with up to five simplified versions each: The original text is version 0 and the most simplified version is 5. The target audience considered was children with different education grade levels. These simplifications were produced manually by professional editors, which is an improvement over SEW where volunteers performed the task.
0
1
Tags
Data Science
Related
Corpora for Simplification
Data-Driven Approaches to Sentence Simplification
Examples of Prompt Templates for Text Simplification
Simplifying Prompt Text for Efficiency
Sequence-to-Sequence Models for Text Simplification
A system is designed to modify text to make it easier to read while preserving the original meaning. Given the original sentence below, which of the following outputs represents the most successful modification according to these goals?
Original: "The legislative body's recent enactment of the statute, which was predicated on extensive empirical analysis, is anticipated to have a profound and multifaceted impact on the nation's socioeconomic fabric."
Evaluating Text Simplification Models
Data Requirements for a Targeted Text Simplification System