A paraphrase or rephrase (/ˈpærəˌfreɪz/) is the rendering of the same text in different words without losing the meaning of the text itself.[1] More often than not, a paraphrased text can convey its meaning better than the original words. In other words, it is a copy of the text in meaning, but which is different from the original. For example, when someone tells a story they heard in own words, they paraphrase, with the meaning being the same.[1] The term itself is derived via Latinparaphrasis, from Ancient Greekπαράφρασις (paráphrasis) 'additional manner of expression'. The act of paraphrasing is also called paraphrasis.
History
Although paraphrases likely abounded in oral traditions, paraphrasing as a specific educational exercise dates back to at least Roman times, when the author Quintilian recommended it for students to develop dexterity in language. In the Middle Ages, this tradition continued, with authors such as Geoffrey of Vinsauf developing schoolroom exercises that included both rhetorical manipulations and paraphrasing as a way of generating poems and speeches. There is interest in the study of paraphrases relating to concerns around plagiarism and original authorship.[2]
Types
For the purposes of education, Fred Inglis identifies five levels of paraphrase:[3][4]
replacing words with synonyms
varying sentence structure
reordering information
turning long sentences into multiple shorter ones (or vice versa)
expressing abstract concepts more concretely.
Paraphrasing with synonyms is considered by some to be an acceptable stage in teaching paraphrase, but it is necessary that it is ultimately combined with techniques for altering sentence structure to avoid the appearance of plagiarism.[5] Studies of English language students have found that ESL learners tend to rely on using synonyms rather than changing sentence structure when paraphrasing. Participants in a study of some Vietnamese ESL learners expressed that they preferred using synonyms out of a fear that using the wrong sentence structure would lead to the sentence having a different meaning. Na and Mai suggest that ESL teachers should provide varied activities including tasks requiring changes in syntax, and that ESL students should be given source texts to paraphrase whose meaning they are already readily able to understand.[6]
Natural Language Processing researchers have defined various (atomic) paraphrase types to better understand how paraphrasing occurs in humans. These types fall into six broad categories, each reflecting different ways in which a text can be altered to convey the same or similar meaning:[7][8][9][10]
morphology-based changes
lexicon-based changes
lexico-syntactic based changes
syntax-based changes
discourse-based changes
extremes.
Morphology-based changes involve alterations at the level of word formation, such as changing the tense of verbs or the number of nouns. For instance, converting "walks" to "walked" represents a morphological change by altering the verb's tense. Lexicon-based changes include changes made by substituting words with their synonyms or closely related words without significantly altering the sentence structure. An example could be changing "quick" to "fast" in a sentence, where both adjectives convey a similar speed attribute. Lexico-syntactic-based changes contain both lexical alterations and modifications in the sentence structure. An example might be transforming an active voice sentence like "The cat chased the mouse" into a passive voice "The mouse was chased by the cat," where both the sentence structure and some words are altered. Syntax-based changes are primarily focused on the structure of the sentence rather than the words themselves. For example, changing a complex sentence into two simpler sentences while maintaining the overall meaning falls into this category. Discourse-based changes are alterations that affect the larger discourse or text structure, such as reordering points in a paragraph or changing the way arguments are presented without altering the factual content. Extremes include changes that significantly alter the text, possibly introducing new information or omitting crucial details, thus pushing the boundaries of what might typically be considered a paraphrase.
Machine learning models have been trained to generate paraphrases with specific properties, such as high semantic similarity and syntactic diversity, or to generate specific paraphrase types.[11][12] Models that have high capacity in paraphrasing can be used for various applications. For example, the granular understanding of the linguistic changes involved in paraphrase generation could be directly applied to support language learners. A model can provide simpler paraphrases considering specific linguistic variations (e.g., syntax) to support students in learning new words and concepts. Universities could create a linguistic profile of their students based on their assignments and better assess their thesis with content similarity detection for potential plagiarism cases.[13][14][15] Different types of paraphrases such as syntax and lexicon changes have also been used for prompt engineering to adjust prompts in specific linguistic aspects to achieve better model outputs.[16][17][18]
Analysis
A paraphrase typically explains or clarifies the text that is being paraphrased.[19] For example, "The signal was red" might be paraphrased as "The train was not allowed to pass as the red signal light was illuminated". A paraphrase can be introduced with verbum dicendi—a declaratory expression to signal the transition to the paraphrase. For example, in "The author states 'The signal was red,' that is, the train was not allowed to proceed," the that is signals the paraphrase that follows.
A paraphrase does not need to accompany a direct quotation.[20] The paraphrase typically serves to put the source's statement into perspective or to clarify the context in which it appeared.[21] A paraphrase is typically more detailed than a summary.[22] One should add the source at the end of the sentence: When the light was red, trains could not go (Wikipedia). A paraphrase may attempt to preserve the essential meaning of the material being paraphrased.[23][24] Thus, the (intentional or otherwise) reinterpretation of a source to infer a meaning that is not explicitly evident in the source itself qualifies as "original research," and not a paraphrase. Unlike a metaphrase, which represents a "formal equivalent" of the source, a paraphrase represents a "dynamic equivalent" thereof. While a metaphrase attempts to translate a text literally, a paraphrase conveys the essential thought expressed in a source text—if necessary, at the expense of literality. For details, see dynamic and formal equivalence.
In your own words
The phrase "in your own words" is often used within this context to imply that the writer has rewritten the text in their own writing style – how they would have written it if they had created the idea.[25] Nowadays, there are some models to learn and recognize paraphrase on natural language texts.[26] Sentences can also be automatically paraphrased using text simplification software.[27]
^Kovatchev, Venelin; Martí, M. Antònia; Salamó, Maria (2018). Calzolari, Nicoletta; Choukri, Khalid; Cieri, Christopher; Declerck, Thierry; Goggi, Sara; Hasida, Koiti; Isahara, Hitoshi; Maegaard, Bente; Mariani, Joseph (eds.). "ETPC - A Paraphrase Identification Corpus Annotated with Extended Paraphrase Typology and Negation". Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan: European Language Resources Association (ELRA).
^Bandel, Elron; Aharonov, Ranit; Shmueli-Scheuer, Michal; Shnayderman, Ilya; Slonim, Noam; Ein-Dor, Liat (2022). Muresan, Smaranda; Nakov, Preslav; Villavicencio, Aline (eds.). "Quality Controlled Paraphrase Generation". Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Dublin, Ireland: Association for Computational Linguistics: 596–609. arXiv:2203.10940. doi:10.18653/v1/2022.acl-long.45.
^Wahle, Jan Philip; Gipp, Bela; Ruas, Terry (2023). "Paraphrase Types for Generation and Detection". In Bouamor, Houda; Pino, Juan; Bali, Kalika (eds.). Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. Singapore: Association for Computational Linguistics. pp. 12148–12164. doi:10.18653/v1/2023.emnlp-main.746.
^Wahle, Jan Philip; Ruas, Terry; Xu, Yang; Gipp, Bela (2024). Al-Onaizan, Yaser; Bansal, Mohit; Chen, Yun-Nung (eds.). "Paraphrase Types Elicit Prompt Engineering Capabilities". Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami, Florida, USA: Association for Computational Linguistics: 11004–11033.