problem in reading tag from web page in java</a></h1> </div> <div class="grid fw-wrap pb8 mb16 bb bc-black-075"> <div class="grid--cell ws-nowrap mr16 mb8" title="2016-01-12 19:07:53Z"> <span class="fc-light mr2">Asked</span> <time itemprop="dateCreated" datetime="2011-05-14T07:10:19.320" class="fromnow">May 14 '11 at 07:10</time> </div> <div class="grid--cell ws-nowrap mr16 mb8"> <span class="fc-light mr2">Active</span> <time class="fromnow" title="2011-05-14T07:16:32.263" datetime="2011-05-14T07:16:32.263">May 14 '11 at 07:16</a> </div> <div class="grid--cell ws-nowrap mb8" title="Viewed 348 times"> <span class="fc-light mr2">Viewed</span> 348 times </div> </div> <div id="mainbar" role="main" aria-label="questions and answers"> <div id="question" class="question" data-questionid="6000347" data-ownerid="685205" data-score="1"> <div class="post-layout"> <div class="votecell post-layout--left"> <div class="js-voting-container grid jc-center fd-column ai-stretch gs4 fc-black-200" data-post-id="6000347"> <button class="js-vote-up-btn grid--cell s-btn s-btn__unset c-pointer"><svg aria-hidden="true" class="m0 svg-icon iconArrowUpLg" width="36" height="36" viewBox="0 0 36 36"><path d="M2 26h32L18 10 2 26z"></path></svg></button> <div class="js-vote-count grid--cell fc-black-500 fs-title grid fd-column ai-center" itemprop="upvoteCount" data-value="1">1</div> <button class="js-bookmark-btn s-btn s-btn__unset c-pointer py4"> <svg aria-hidden="true" class="svg-icon iconBookmark" width="18" height="18" viewBox="0 0 18 18"><path d="M6 1a2 2 0 00-2 2v14l5-4 5 4V3a2 2 0 00-2-2H6zm3.9 3.83h2.9l-2.35 1.7.9 2.77L9 7.59l-2.35 1.7.9-2.76-2.35-1.7h2.9L9 2.06l.9 2.77z"></path></svg> <div class="js-bookmark-count mt4" data-value=""></div> </button> </div> </div> <div class="postcell post-layout--right"> <div class="s-prose js-post-body" itemprop="text"><p>I am using jtidy parser to parse the web page. It is working, sort of:</p> <pre><code>InputStream in=new URL("http://www.medicinenet.com/alopecia_areata/article.htm").openStream(); Document doc= new Tidy().parseDOM(in, null); String titleText=doc.getElementsByTagName("title").item(0).getFirstChild().getNodeValue(); </code></pre> <p>It is working fine for <code><title>...</title></code>, but the url which I passed, it contains title tag <code><TITLE>...</TITLE></code> in capital letter. So it is returning null.</p> <p>How to read <code><TITLE>...</TITLE></code> & <code><title>...</title></code> in one statement using java code? Please help me.</p></div> <div class="mt24 mb12"> <div class="post-taglist grid gs4 gsy fd-column"> <div class="grid ps-relative"> <a href="../../questions/tagged/java" class="post-tag js-gps-track" title="show questions tagged 'java'" rel="tag">java</a> <a href="../../questions/tagged/jtidy" class="post-tag js-gps-track" title="show questions tagged 'jtidy'" rel="tag">jtidy</a> </div> </div> </div> <div class="mb0"> <div class="mt16 grid gs8 gsy fw-wrap jc-end ai-start pt4 mb16"> <div class="grid--cell mr16 fl1 w96"></div> <div class="post-signature grid--cell"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="edited May 14 '11 at 07:14">edited May 14 '11 at 07:14</time> <a href="../../users/69224/corlettk" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/69224.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="corlettk" /> </a> <div class="s-user-card--info"> <a href="../../users/69224/corlettk" class="s-user-card--link">corlettk</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">13,288</li> <li class="s-award-bling s-award-bling__gold" title="7 gold badges">7</li> <li class="s-award-bling s-award-bling__silver" title="38 silver badges">38</li> <li class="s-award-bling s-award-bling__bronze" title="52 bronze badges">52</li> </ul> </div> </div> </div> <div class="post-signature owner grid--cell"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked May 14 '11 at 07:10">asked May 14 '11 at 07:10</time> <a href="../../users/685205/dj31" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/685205.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="DJ31" /> </a> <div class="s-user-card--info"> <a href="../../users/685205/dj31" class="s-user-card--link">DJ31</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">1,219</li> <li class="s-award-bling s-award-bling__gold" title="3 gold badges">3</li> <li class="s-award-bling s-award-bling__silver" title="14 silver badges">14</li> <li class="s-award-bling s-award-bling__bronze" title="19 bronze badges">19</li> </ul> </div> </div> </div> </div> </div> </div> <div class="post-layout--right js-post-comments-component"> </div> </div> </div> <div id="answers"> <a name="tab-top"></a> <div id="answers-header"> <div class="answers-subheader grid ai-center mb8"> <div class="grid--cell fl1"> <h2 class="mb0" data-answercount="9">1 Answers<span style="display:none;" itemprop="answerCount">1</span></h2> </div> </div> </div> <a name="6000375"></a> <div id="answer-6000375" class="answer " data-answerid="6000375" data-ownerid="449856" data-score="1" itemprop="suggestedAnswer" itemscope="" itemtype="https://schema.org/Answer"> <div class="post-layout"> <div class="votecell post-layout--left"> <div class="js-voting-container grid jc-center fd-column ai-stretch gs4 fc-black-200" data-post-id="6000375"> <button class="js-vote-up-btn grid--cell s-btn s-btn__unset c-pointer"><svg aria-hidden="true" class="m0 svg-icon iconArrowUpLg" width="36" height="36" viewBox="0 0 36 36"><path d="M2 26h32L18 10 2 26z"></path></svg></button> <div class="js-vote-count grid--cell fc-black-500 fs-title grid fd-column ai-center" itemprop="upvoteCount" data-value="1">1</div> </div> </div> <div class="postcell post-layout--right"> <div class="s-prose js-post-body" itemprop="text"><p>Just check for null, then check uppercase</p> <pre><code>String titleText=doc.getElementsByTagName("title").item(0).getFirstChild().getNodeValue(); if (titleText == null) titleText=doc.getElementsByTagName("TITLE").item(0).getFirstChild().getNodeValue(); </code></pre> <p><code>getElementsByTagName</code> is case sensitive, so this is the simplest option.</p></div> <div class="mb0"> <div class="mt16 grid gs8 gsy fw-wrap jc-end ai-start pt4 mb16"> <div class="grid--cell mr16 fl1 w96"></div> <div class="post-signature grid--cell"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="answered May 14 '11 at 07:16">answered May 14 '11 at 07:16</time> <a href="../../users/449856/codemwnci" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/449856.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="Codemwnci" /> </a> <div class="s-user-card--info"> <a href="../../users/449856/codemwnci" class="s-user-card--link">Codemwnci</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">54,176</li> <li class="s-award-bling s-award-bling__gold" title="10 gold badges">10</li> <li class="s-award-bling s-award-bling__silver" title="96 silver badges">96</li> <li class="s-award-bling s-award-bling__bronze" title="129 bronze badges">129</li> </ul> </div> </div> </div> </div> </div> </div> <div class="post-layout--right js-post-comments-component"> <div id="comments-6000375" class="comments js-comments-container bt bc-black-075 mt12 " data-post-id="6000375" data-min-length="15"> <ul class="comments-list js-comments-list" data-remaining-comments-count="0" data-canpost="false" data-cansee="true" data-comments-unavailable="false" data-addlink-disabled="true"> <li id="comment-6933173" class="comment js-comment " data-comment-id="6933173" data-comment-owner-id="685205" data-comment-score="0"> <div class="js-comment-actions comment-actions"> <div class="comment-score js-comment-edit-hide"> </div> </div> <div class="comment-text js-comment-text-and-form"> <a name="comment6933173_6000375"></a> <div class="comment-body js-comment-edit-hide"> <span class="comment-copy">I tried above code,but this is not working,please help me with another solution</span> – <a href="../../users/685205/dj31" title="1,219 reputation" class="comment-user owner">DJ31</a> <span class="comment-date" dir="ltr"><a class="comment-link" href="../../questions/6000347/problem-in-reading-title-tag-from-web-page-in-java#comment6933173_6000375"><span title="2011-05-14T12:32:00.947 License: CC BY-SA 3.0" class="relativetime-clean">May 14 '11 at 12:32</span></a></span> </div> </div> </li> <li id="comment-6949692" class="comment js-comment " data-comment-id="6949692" data-comment-owner-id="685205" data-comment-score="0"> <div class="js-comment-actions comment-actions"> <div class="comment-score js-comment-edit-hide"> </div> </div> <div class="comment-text js-comment-text-and-form"> <a name="comment6949692_6000375"></a> <div class="comment-body js-comment-edit-hide"> <span class="comment-copy">I tried this code as you said---> if (titleText == null) titleText=doc.getElementsByTagName("TITLE").item(0).getFirstChild().getNodeValue();----------it is throwing exception,again it is returning null.</span> – <a href="../../users/685205/dj31" title="1,219 reputation" class="comment-user owner">DJ31</a> <span class="comment-date" dir="ltr"><a class="comment-link" href="../../questions/6000347/problem-in-reading-title-tag-from-web-page-in-java#comment6949692_6000375"><span title="2011-05-16T05:28:50.167 License: CC BY-SA 3.0" class="relativetime-clean">May 16 '11 at 05:28</span></a></span> </div> </div> </li> <li id="comment-6950028" class="comment js-comment " data-comment-id="6950028" data-comment-owner-id="449856" data-comment-score="0"> <div class="js-comment-actions comment-actions"> <div class="comment-score js-comment-edit-hide"> </div> </div> <div class="comment-text js-comment-text-and-form"> <a name="comment6950028_6000375"></a> <div class="comment-body js-comment-edit-hide"> <span class="comment-copy">is the title tag in mixed case...e,g <title> – Codemwnci May 16 '11 at 06:13

  • no,it is not in mixed case,it is in capital letter,''. – DJ31 May 16 '11 at 06:39