The XPath for plain text extraction should be user configurable #14

nz · 2016-05-07T16:40:02Z

Currently, we look for all text under an article element with //article//text() — this should probably be configurable as it will vary from layout to layout.

The text was updated successfully, but these errors were encountered:

drusellers · 2017-03-26T13:08:12Z

A+

and use a data- attribute by default ie <div data-searchyll-content> type approach so that users can choose anything they want

drusellers · 2017-03-26T15:03:31Z

//article[@data-searchyll='true']/text()

<?xml version="1.0" encoding="UTF-8"?>

<div> 
  <article data-searchyll="true">abc</article> 
</div>

nz added the 1 - Ready label May 7, 2016

nz added this to the 1.0.0 milestone May 7, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The XPath for plain text extraction should be user configurable #14

The XPath for plain text extraction should be user configurable #14

nz commented May 7, 2016 •

edited

Loading

drusellers commented Mar 26, 2017

drusellers commented Mar 26, 2017

The XPath for plain text extraction should be user configurable #14

The XPath for plain text extraction should be user configurable #14

Comments

nz commented May 7, 2016 • edited Loading

drusellers commented Mar 26, 2017

drusellers commented Mar 26, 2017

nz commented May 7, 2016 •

edited

Loading