Ignoring Header/Footer text when using TIKA - apache-tika

I'm using IKVM in order to use the TIKA library in a .NET application. I'm able to extract text but now I want to tell TIKA that I do NOT want the Header/Footer information. TIKA case TIKA-906 shows that the latest version now includes the header/footer text, but does not show how to exclude it.
I'm pretty much using the same code outlined here.
Any help would be greatly appreciated.

Related

How to add header/footer with dynamic pages using jsPDF without using fromHTML

I am not using fromHTML, as it has limited modification options. I've also included image in header with base64 which is working fine.
My concern is with header and footer which should be on all pages of PDF. I don't want to use addPage() method as PDF's data will be dynamic.
Really in need for the answer. Much appreciate the support, thanks.

How to print .docx,doc,xls,csv in rails 4?

How can I print .docx,doc,xls,csv in rails 4?
I tried window.open and window.print using javascript, but each time when it invokes window.open it downloads the file.
I tried to embed that in an iframe, that also fails.
Displaying a specific type of file isn't something that relies on your application it relies on the browser. In common browsers there is no support to display the file formats you mentioned.
What you can do is implementing some document viewers based on js if they are available for your usage. For example I know there is a plugin viewer.js that provides functionality to convert office and pdf to html. That might help you out... https://developers.box.com/viewer-js-tutorial/
I tried doing this some time back and this is the blog which i refereed. Helped a great deal.
Hope this helps.

Display bar graph using export plugin in grails

I have sample data which I want to show in bar/pie graph using export plugin in grails. I googled a lot but was unable to get exact details of how to do it?
Can anybody please provide me links or any information to start off with this? Thank you.
You didn't provide much description about exactly what your use case is but there are a number of charting plugins for Grails mentioned in this question:
Grails and charting?
These plugins would enable you to render charts on your page. Then there's the issue of converting to PDF. Is it acceptable for users to print the web page to PDF using the browser's functionality? If not, again there are plugins for converting GSPs to PDF (such as http://grails.org/plugin/pdf).
Hopefully you should be able to do what you want with a combination of plugins.

Menu dropdown messed up after translating page using Google translator

My menu drop-down is now messed up, after I translated the page using on-the-fly Google translater. It refuses to recognize CSS classes I have created and used for exclusion (e.g, I have a class like this class="notranslate", which is no no longer working).
For demostration purpose, I have source code
here
Note: Must register to the site first. Any help would be appreciate it. Thanks
Not sure what is going on but look at this tool: Translate This
This uses the Google Translate but is easier to use.
Hope this can help you.

docbook wysiwyg editor for the web

Is there any ready-to-go docbook editor for web applications out there?
I'm thinking of something like FCKeditor or TinyMCE that returns a docbook xml code back to the web application. I am aware that FCKeditor is customizable to do this, but I have not heard of anyone who did this.
Maybe the Xopus web based XML editor does the trick?
http://xopus.com/xopus-web-based-wysiwyg-xml-editor.html
It has support for Dita and Docbook.
I think you are looking for an online docbook IDE or an editor that automates a lot of the XML rigmarole associated with the docbook format.
You could try the oXygen editor http://www.oxygenxml.com/xml_editor/docbook_editor.html . Their trial is 30 days to see if you like it.
Hopefully that gives you some new google terms as well, but that is all I have.
(If, in addition to a good tool for what you need, you find a nice graphical validator. I would love to see something that works better for my work flow than command line jing.)
Docbook is one of the file formats supported for single-source editing in Open/ Libreoffice.
Note that there can be no such thing as a WYSIWYG editor for a presentation-neutral document representation: to be WYSIWYG, things like page dimensions and margins need to be fixed.

Resources