![]() Includes/porter_stemmer.php – a lousy stemmer script used by the summarizer class. Includes/summarizer.php – the Summarizer PHP class. Test_summarizer.php – a simple demo of the text summarizer script. ![]() For example, if the word “Linux” occurs 4 times overall, and the word “Windows” occurs 3 times, then the sentence “Windows bad, Linux – Linux good!” will get a rating of 11 (assuming “bad” and “good” didn’t make it into the Top 20 word list). In this case I simply added together the popularity ratings of every “important” word in the sentence. Rate each sentence by the words it contains.Also, the “top 20” threshold is a mostly arbitrary choice, so feel free to experiment with other numbers. The idea is that the most common words reflect the main topics of the input text. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |