-3

website = https://www.sec.gov/Archives/edgar/data/3662/0000950170-98-000413.txt

I want to extract paragraph named by "MANAGEMENT'S DISCUSSION AND ANALYSIS OF FINANCIAL CONDITION AND RESULTS OF OPERATIONS" in text format in python from the given link using for loop.

James Z
  • 12,209
  • 10
  • 24
  • 44
  • 5
    What code have you written to do this and where exactly are you stuck? – ForceBru Aug 06 '19 at 11:40
  • 2
    See also [Under what circumstances may I add "urgent" or other similar phrases to my question, in order to obtain faster answers?](//meta.stackoverflow.com/q/326569) (tl;dr: never) – grooveplex Aug 06 '19 at 11:42
  • The problem of extracting specific Items from EDGAR filings has been discussed very frequently. Search for the Q&As - but you should know: it's very difficult. – Jack Fleeting Aug 06 '19 at 12:00

1 Answers1

0

You can use python urllib2 library to do this.

Example of python code:

import urllib2

data = urllib2.urlopen("https://wordpress.org/plugins/about/readme.txt")  # read file
data = data.split("\n") # then split it into lines

for line in data:
    if line = "MANAGEMENT'S DISCUSSION AND ANALYSIS OF FINANCIAL CONDITION AND RESULTS OF OPERATIONS"
    print line # or what you want to do
PirrenCode
  • 444
  • 4
  • 14