- prompts for filetype from user if none entered - modularized a couple functions
Not secure
- fixed out_file naming - pep8 and pylint reformatting
Not secure
- removed read_part_files in place of get_part_files as pdfkit reads filenames
Not secure
- fixed bug preventing writing scraped urls to pdf
Not secure
- can now read in text and filter it - recognizes local files, no need for user to enter special flag - moved html/ files to testing/ and added a text file to it - added better distinction between input and output files - changed instances of file to f_name in utils - pep8 reformatting
Not secure
- add scheme to urls if none present - fixed bug where raw_html was calling get_html rather than get_raw_html