One simple way to extract the text in a webpage is to remove all HTML tags enclosed by <> pairs. However, the extracted text will be a long character string. This project is to extract texts in a webpage and save them in a text file with predefined formats Your program should use the following table to convert a HTML file into a pure text file HTML Tags Page titleKtitle My webpage title Headings
OR
OR