Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The XPath for plain text extraction should be user configurable #14

Open
nz opened this issue May 7, 2016 · 2 comments
Open

The XPath for plain text extraction should be user configurable #14

nz opened this issue May 7, 2016 · 2 comments
Milestone

Comments

@nz
Copy link
Member

nz commented May 7, 2016

Currently, we look for all text under an article element with //article//text() — this should probably be configurable as it will vary from layout to layout.

@nz nz added the 1 - Ready label May 7, 2016
@nz nz added this to the 1.0.0 milestone May 7, 2016
@drusellers
Copy link
Contributor

A+

and use a data- attribute by default ie <div data-searchyll-content> type approach so that users can choose anything they want

@drusellers
Copy link
Contributor

//article[@data-searchyll='true']/text()
<?xml version="1.0" encoding="UTF-8"?>

<div> 
  <article data-searchyll="true">abc</article> 
</div>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants