Questions tagged [vtd-xml]

Virtual Token Descriptor for eXtensible Markup Language (VTD-XML) refers to a collection of cross-platform XML processing technologies centered on a non-extractive XML, "document-centric" parsing technique called Virtual Token Descriptor (VTD).

Virtual Token Descriptor for eXtensible Markup Language (VTD-XML) refers to a collection of cross-platform XML processing technologies centered on a non-extractive XML, "document-centric" parsing technique called Virtual Token Descriptor (VTD). Depending on the perspective, VTD-XML can be viewed as one of the following:

  • A "Document-Centric" XML parser
  • A native XML indexer or a file format that uses binary data to enhance the text XML
  • An incremental XML content modifier
  • An XML slicer/splitter/assembler
  • An XML editor/eraser
  • A way to port XML processing on chip
  • A non-blocking, stateless XPath evaluator

VTD-XML is developed by XimpleWare and dual-licensed under GPL and proprietary license. It is originally written in Java, but is now available in C, C++ and C#.

Download vtd-xml at https://vtd-xml.sourceforge.io/

Here are a few articles describing how to code applications in VTD-XML

190 questions
2
votes
1 answer

VTD-XML: How to get XPath of current element?

I'm using VTD to parse and split very large XML files and require the actual XPath of the current element pointed to by the cursor after a call to AutoPilot.evalXPath(). Consider the following XML:
lea
  • 21
  • 2
2
votes
1 answer

JVM run time memory / net beans run time memory /JVM run time memory with arguments (VTD-Parser)

I am parsing a file which is around 600MB in size. Below is memory usage when the file runs 1] When running from Netbeans with -Xms1g runtime JVM arguments it takes around 1.03GB memory (checked from Task Manager) 2] When I directly ran JAR it…
Bhavik Patel
  • 1,044
  • 1
  • 15
  • 33
2
votes
1 answer

XPath Evaluation fails while using with AutoPilot in VTD-XML

I have a bunch of xml's which contains elements's with namespaces and i am using VTD-XML to parse that xml. But when i tries to use an xpath to get a namespaced element. It will not fetch at all? Here is the xml sample
Sumanth Shastry
  • 1,139
  • 1
  • 19
  • 28
2
votes
1 answer

What does reusing XPath mean across XMLs?

I would like to under stand what does reusing XPath mean across XMLs? Does this mean that if I am parsing 100 XMLs, I will only have one AutoPilot and just call AutoPilot.bind in every XML?
sincang
  • 119
  • 5
2
votes
1 answer

VTD-XML seems to be spoiling escaped string in XML document

I am working on an XML data set (the DrugBank database available here) where some fields contain escaped XML characters like "&", etc. To make the problem more concrete, here is an example scenario:
Chthonic Project
  • 8,216
  • 1
  • 43
  • 92
2
votes
1 answer

How to spit the large XML(more than 3GB) using VTD-XML extended

I have to split an xml which is of minimum size of 3GB. We can provide only 1.5GB heap space in 64 bit JVM on Windows OS. I have got example codes all over the Internet using VTDNav only, not with VTDNavHuge. The agenda is to read the above…
2
votes
1 answer

VTD-xml ignore well formed file

I will parse xml file (this peace of file): Ardent…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/xml-parsing" class="post-tag grid--cell" title="show questions tagged 'xml-parsing'" rel="tag">xml-parsing</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> <a href="../../questions/tagged/well-formed" class="post-tag grid--cell" title="show questions tagged 'well-formed'" rel="tag">well-formed</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Apr 10 '14 at 10:50">asked Apr 10 '14 at 10:50</time> <a href="../../users/1103260/pheonix" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/1103260.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="Pheonix" /> </a> <div class="s-user-card--info"> <a href="../../users/1103260/pheonix" class="s-user-card--link">Pheonix</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">161</li> <li class="s-award-bling s-award-bling__silver" title="3 silver badges">3</li> <li class="s-award-bling s-award-bling__bronze" title="14 bronze badges">14</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-22717078"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status answered-accepted"> <strong>1</strong> answer </div> </div> </div> <div class="summary"> <h3><a href="../../questions/22717078/xpath-on-log4j-xml-using-vtd-xml" class="question-hyperlink">XPath on log4j.xml using VTD-XML</a></h3> <div class="excerpt">I have validLog4jXML String like this <?xml version="1.0" encoding="ISO-8859-1" ?> <!DOCTYPE log4j:eventSet PUBLIC "-//APACHE//DTD LOG4J 1.2//EN" "log4j.dtd" > <log4j:eventSet version="1.2" xmlns:log4j="http://jakarta.apache.org/log4j/"…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/java" class="post-tag grid--cell" title="show questions tagged 'java'" rel="tag">java</a> <a href="../../questions/tagged/xml" class="post-tag grid--cell" title="show questions tagged 'xml'" rel="tag">xml</a> <a href="../../questions/tagged/xpath" class="post-tag grid--cell" title="show questions tagged 'xpath'" rel="tag">xpath</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Mar 28 '14 at 15:31">asked Mar 28 '14 at 15:31</time> <a href="../../users/2638049/serge" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/2638049.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="serge" /> </a> <div class="s-user-card--info"> <a href="../../users/2638049/serge" class="s-user-card--link">serge</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">1,590</li> <li class="s-award-bling s-award-bling__gold" title="2 gold badges">2</li> <li class="s-award-bling s-award-bling__silver" title="26 silver badges">26</li> <li class="s-award-bling s-award-bling__bronze" title="49 bronze badges">49</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-22060447"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status "> <strong>1</strong> answer </div> </div> </div> <div class="summary"> <h3><a href="../../questions/22060447/how-to-remove-a-specific-node-using-vtd-xml-parser" class="question-hyperlink">How to remove a specific node using VTD-XML parser</a></h3> <div class="excerpt">With the VTD-XML parser how do I do below. <root> <A> <B> <c>1<c/> <d>2<d/> <e>3<e/> </B> <B> <c>1<c/> <d>2<d/> <e>3<e/> </B> </A> </root> In the above xml how do I…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/java" class="post-tag grid--cell" title="show questions tagged 'java'" rel="tag">java</a> <a href="../../questions/tagged/xml" class="post-tag grid--cell" title="show questions tagged 'xml'" rel="tag">xml</a> <a href="../../questions/tagged/xml-parsing" class="post-tag grid--cell" title="show questions tagged 'xml-parsing'" rel="tag">xml-parsing</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Feb 27 '14 at 06:11">asked Feb 27 '14 at 06:11</time> <a href="../../users/2771655/user2771655" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/2771655.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="user2771655" /> </a> <div class="s-user-card--info"> <a href="../../users/2771655/user2771655" class="s-user-card--link">user2771655</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">1,052</li> <li class="s-award-bling s-award-bling__gold" title="3 gold badges">3</li> <li class="s-award-bling s-award-bling__silver" title="20 silver badges">20</li> <li class="s-award-bling s-award-bling__bronze" title="38 bronze badges">38</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-21262030"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status answered-accepted"> <strong>1</strong> answer </div> </div> </div> <div class="summary"> <h3><a href="../../questions/21262030/parse-xml-file-using-vtd-xml" class="question-hyperlink">Parse xml file using vtd-xml</a></h3> <div class="excerpt">How can i parse a xml file like the following using vtd-xml in java or C ? <?xml version="1.0" encoding="utf-8"?> <StockReport> <Article Code="027783012" Height="35" Width="36" Length="136" TotalPacks="4" AvailablePacks="4" StockReturnPacks="4"…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/java" class="post-tag grid--cell" title="show questions tagged 'java'" rel="tag">java</a> <a href="../../questions/tagged/c" class="post-tag grid--cell" title="show questions tagged 'c'" rel="tag">c</a> <a href="../../questions/tagged/xml" class="post-tag grid--cell" title="show questions tagged 'xml'" rel="tag">xml</a> <a href="../../questions/tagged/xml-parsing" class="post-tag grid--cell" title="show questions tagged 'xml-parsing'" rel="tag">xml-parsing</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Jan 21 '14 at 15:15">asked Jan 21 '14 at 15:15</time> <a href="../../users/1176668/famedoro" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/1176668.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="famedoro" /> </a> <div class="s-user-card--info"> <a href="../../users/1176668/famedoro" class="s-user-card--link">famedoro</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">1,223</li> <li class="s-award-bling s-award-bling__gold" title="2 gold badges">2</li> <li class="s-award-bling s-award-bling__silver" title="17 silver badges">17</li> <li class="s-award-bling s-award-bling__bronze" title="41 bronze badges">41</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-19328840"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status answered-accepted"> <strong>1</strong> answer </div> </div> </div> <div class="summary"> <h3><a href="../../questions/19328840/recursive-printing-of-tree-structure-from-xml-having-strange-behavior-in-java" class="question-hyperlink">recursive printing of tree structure from XML having strange behavior in java</a></h3> <div class="excerpt">I'm reading in an XML file and recreating the structure for reprinting minus all attributes, values, and text. It's working fine as far as I can tell, but when I try to reprint it in the console I get very strange behavior. for( xmlNode n :…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/java" class="post-tag grid--cell" title="show questions tagged 'java'" rel="tag">java</a> <a href="../../questions/tagged/xml" class="post-tag grid--cell" title="show questions tagged 'xml'" rel="tag">xml</a> <a href="../../questions/tagged/parsing" class="post-tag grid--cell" title="show questions tagged 'parsing'" rel="tag">parsing</a> <a href="../../questions/tagged/recursion" class="post-tag grid--cell" title="show questions tagged 'recursion'" rel="tag">recursion</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Oct 11 '13 at 23:41">asked Oct 11 '13 at 23:41</time> <a href="../../users/494986/faultyjuggler" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/494986.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="FaultyJuggler" /> </a> <div class="s-user-card--info"> <a href="../../users/494986/faultyjuggler" class="s-user-card--link">FaultyJuggler</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">532</li> <li class="s-award-bling s-award-bling__gold" title="1 gold badge">1</li> <li class="s-award-bling s-award-bling__silver" title="8 silver badge">8</li> <li class="s-award-bling s-award-bling__bronze" title="29 bronze badge">29</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-17704235"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status answered-accepted"> <strong>2</strong> answers </div> </div> </div> <div class="summary"> <h3><a href="../../questions/17704235/android-xpath-evaluate-very-slow" class="question-hyperlink">Android - XPath evaluate very slow</a></h3> <div class="excerpt">I'm using XPath to query my XML-file which has at the moment about 100KB. I'm iterating of an array and query for every value in the list. Unfortunately a single query takes about 3-4 seconds under the debugger and slightly less with debugger…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/java" class="post-tag grid--cell" title="show questions tagged 'java'" rel="tag">java</a> <a href="../../questions/tagged/android" class="post-tag grid--cell" title="show questions tagged 'android'" rel="tag">android</a> <a href="../../questions/tagged/xml" class="post-tag grid--cell" title="show questions tagged 'xml'" rel="tag">xml</a> <a href="../../questions/tagged/xpath" class="post-tag grid--cell" title="show questions tagged 'xpath'" rel="tag">xpath</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Jul 17 '13 at 15:39">asked Jul 17 '13 at 15:39</time> <a href="../../users/1331671/ron" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/1331671.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="Ron" /> </a> <div class="s-user-card--info"> <a href="../../users/1331671/ron" class="s-user-card--link">Ron</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">22,128</li> <li class="s-award-bling s-award-bling__gold" title="31 gold badges">31</li> <li class="s-award-bling s-award-bling__silver" title="108 silver badges">108</li> <li class="s-award-bling s-award-bling__bronze" title="206 bronze badges">206</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-17700803"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status "> <strong>0</strong> answers </div> </div> </div> <div class="summary"> <h3><a href="../../questions/17700803/vtd-xml-for-jasper" class="question-hyperlink">VTD-XML for Jasper</a></h3> <div class="excerpt">Has anyone used VTD-XML as the XPath engine? I am unable to find any reference to this combination. This in the lines of: JasperReports fillReport too slow and resource consuming Should I create my custom implementation of JRXpathExecutor that uses…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/jasper-reports" class="post-tag grid--cell" title="show questions tagged 'jasper-reports'" rel="tag">jasper-reports</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Jul 17 '13 at 13:14">asked Jul 17 '13 at 13:14</time> <a href="../../users/362492/gsai" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/362492.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="GSai" /> </a> <div class="s-user-card--info"> <a href="../../users/362492/gsai" class="s-user-card--link">GSai</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">132</li> <li class="s-award-bling s-award-bling__silver" title="2 silver badges">2</li> <li class="s-award-bling s-award-bling__bronze" title="9 bronze badges">9</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-16724270"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status "> <strong>1</strong> answer </div> </div> </div> <div class="summary"> <h3><a href="../../questions/16724270/which-components-of-vtd-xml-are-thread-safe" class="question-hyperlink">Which components of VTD xml are thread safe?</a></h3> <div class="excerpt">Using VTD 2.11 Can VTDGen be init once and used by multiple threads? For instance I want to use it in a servlet so the VTGen gets initialized once when the servlet get initialized and then each request coming in parse what ever document is…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked May 23 '13 at 21:33">asked May 23 '13 at 21:33</time> <a href="../../users/432024/user432024" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/432024.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="user432024" /> </a> <div class="s-user-card--info"> <a href="../../users/432024/user432024" class="s-user-card--link">user432024</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">4,392</li> <li class="s-award-bling s-award-bling__gold" title="8 gold badges">8</li> <li class="s-award-bling s-award-bling__silver" title="49 silver badges">49</li> <li class="s-award-bling s-award-bling__bronze" title="85 bronze badges">85</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="mln24"> <div class="question-summary" id="question-summary-14987792"> <div class="statscontainer"> <div class="stats"> <div class="vote"> <div class="votes"> <span class="vote-count-post"><strong>2</strong></span> <div class="viewcount">votes</div> </div> </div> <div class="status answered-accepted"> <strong>1</strong> answer </div> </div> </div> <div class="summary"> <h3><a href="../../questions/14987792/unexpected-noderecorder-behaviour" class="question-hyperlink">Unexpected NodeRecorder behaviour</a></h3> <div class="excerpt">Given the following XML doc: <?xml version="1.0" encoding="UTF-8"?> <InlinePurchaseOrder> <largeMessage> <BuyersID>20031234-1</BuyersID> <IssueDate>2003-01-23</IssueDate> <LineExtensionTotalAmount amountCurrencyID="USD"…</div> <div class="grid ai-start jc-space-between fw-wrap"> <div class="grid gs4 fw-wrap tags "> <a href="../../questions/tagged/xml" class="post-tag grid--cell" title="show questions tagged 'xml'" rel="tag">xml</a> <a href="../../questions/tagged/vtd-xml" class="post-tag grid--cell" title="show questions tagged 'vtd-xml'" rel="tag">vtd-xml</a> </div> <div class="started mt0"> <div class="s-user-card s-user-card"> <time class="s-user-card--time" datetime="asked Feb 20 '13 at 18:57">asked Feb 20 '13 at 18:57</time> <a href="../../users/579400/rick-barkhouse" class="s-avatar s-avatar__32 s-user-card--avatar"> <img class="s-avatar--image" src="../../users/profiles/579400.webp" data-jdenticon-width="32" data-jdenticon-height="32" data-jdenticon-value="Rick Barkhouse" /> </a> <div class="s-user-card--info"> <a href="../../users/579400/rick-barkhouse" class="s-user-card--link">Rick Barkhouse</a> <ul class="s-user-card--awards"> <li class="s-user-card--rep" title="reputation score">1,186</li> <li class="s-award-bling s-award-bling__gold" title="2 gold badges">2</li> <li class="s-award-bling s-award-bling__silver" title="10 silver badges">10</li> <li class="s-award-bling s-award-bling__bronze" title="15 bronze badges">15</li> </ul> </div> </div> </div> </div> </div> </div> </div> <div class="s-pagination pager fr"> <a class="s-pagination--item" href="../../questions/tagged/vtd-xml_page=3" rel="prev" title="Go to page 3">Prev </a> <a class="s-pagination--item" href="../../questions/tagged/vtd-xml_page=1" rel="" title="Go to page 1">1</a> <a class="s-pagination--item" href="../../questions/tagged/vtd-xml_page=2" rel="" title="Go to page 2">2</a> <a class="s-pagination--item" href="../../questions/tagged/vtd-xml_page=3" rel="" title="Go to page 3">3</a> <div class="s-pagination--item s-pagination--item__clear">…</div> <a class="s-pagination--item" href="../../questions/tagged/vtd-xml_page=12" rel="" title="Go to page 12">12</a> <a class="s-pagination--item" href="../../questions/tagged/vtd-xml_page=13" rel="" title="Go to page 13">13</a> <a class="s-pagination--item" href="../../questions/tagged/vtd-xml_page=5" rel="next" title="Go to page 5"> Next</a> </div> </div> </div> </div> </div> <script src="../../static/js/stack-icons.js"></script> <script src="../../static/js/fromnow.js"></script> </body> </html>