In this assignment, I applied a text summarization API(http://textsummarization.net) to an abridged version of The Republic, Book I. textsummarization.net, managed by a group of professional Natural Language Processing and Machine Learning researchers from University of Science and Technology of China(USTC), is one of the several popular text summarization APIs online. In this experiment, I want to summarize The Republic downto one sentence.
Completing this task in one setting can be very demanding of the algorithm and does not conform with how information is conventionally processed. Therefore, I decided to cut 1/2 of the text from every last attempt to produce the new summary. The original file has approximately 479 sentences; it is reduced from 479 to 240, from 240 to 120… until there is only one sentence left.
I’m still trying to learn more about the exact algorithm applied in the API. Therefore, I cannot comment on how reliable it is. However, since a decent amount information is kept in the first few rounds (and they seem to convey the material in the Republic pretty well), we may assume that the algorithm is at least, functional. Although in the final summary, the information in The Republic is completely destroyed, I must admit that the final summary is exactly my what I can tell about The Republic at the moment (after studied it a few semesters ago).