I want to extract a certain paragraph from a text type website using for loop

Question

website = https://www.sec.gov/Archives/edgar/data/3662/0000950170-98-000413.txt

I want to extract paragraph named by "MANAGEMENT'S DISCUSSION AND ANALYSIS OF FINANCIAL CONDITION AND RESULTS OF OPERATIONS" in text format in python from the given link using for loop.

What code have you written to do this and where exactly are you stuck? — ForceBru, Aug 06 '19 at 11:40
See also [Under what circumstances may I add "urgent" or other similar phrases to my question, in order to obtain faster answers?](//meta.stackoverflow.com/q/326569) (tl;dr: never) — grooveplex, Aug 06 '19 at 11:42
The problem of extracting specific Items from EDGAR filings has been discussed very frequently. Search for the Q&As - but you should know: it's very difficult. — Jack Fleeting, Aug 06 '19 at 12:00

score 0 · Answer 1 · answered Aug 06 '19 at 11:57

0

You can use python urllib2 library to do this.

Example of python code:

import urllib2

data = urllib2.urlopen("https://wordpress.org/plugins/about/readme.txt")  # read file
data = data.split("\n") # then split it into lines

for line in data:
    if line = "MANAGEMENT'S DISCUSSION AND ANALYSIS OF FINANCIAL CONDITION AND RESULTS OF OPERATIONS"
    print line # or what you want to do

answered Aug 06 '19 at 11:57

PirrenCode

444
4
14

I want the whole paragraph i.e. from page 14 to page 19 in this link. But in a generalised format – Vijay Kumar Aug 06 '19 at 15:24
How is page delimetered in your .txt file? There must be delimeter – PirrenCode Aug 07 '19 at 07:03

I want to extract a certain paragraph from a text type website using for loop

1 Answers1