XBRL Generator(Mapping tool) - mapping

I am now developing an accounting web application to generate XBRL file then post to a government host web service. Now I have taxonomy prepared and user data ready. What I need is a software which can quickly map those data base on the taxonomy and generate XBRL files. Could anyone provide me some software which can achieve this? Open source is the best that I can implement into my project. Thanks in advance.

XBRL is a broad area, from detailed accounting data (XBRL GL) to summary reports to highly dimensional, data warehouse type content.
Commercially, products like Altova MapForce are really good for mapping from underlying data sources to XBRL taxonomies and creating content.
I assume you have already looked through SourceForge and found tools like Gepsio (https://gepsio.codeplex.com/). Arelle is a great start for people who know Python. (www.arelle.org). And there is a ton of Open Source and XBRL at xbrlwiki.info.

Related

Implementation of articulate

I am looking at Articulate to support my own application. I want my users to be able to use Articulate to create the E-learning and then uploading the files to my site and run it on my site.
However i wish to store both the questions and the answers of the quiz that is created in articulate.
Does anyone know how this is possible or know a way around either by using excel or any other software to store the Q/A?
There may be more than one answer. But Articulate is SCORM compliant. It is designed to return test scores to the server using SCORM. I have not been involved with receiving those test scores by acting as an LMS, but in the past I have made custom applications that send that data to the LMS using scorm. But if you research SCORM and other APIs they use you might find something useful.
For example, if you mean Articulate Storyline then you can test send/receiving data using this package:
http://www.articulate.com/support/storyline/how-to-use-the-scorm-test-suite
This shows that it can be done. But you would have to write your own program to communicate with Storyline. If you can duplicate how that package works you might be on your way.
As an alternative to SCORM, you can also use the xAPI see https://xapi.com/overview/ to store your e-learning data

Human annotation tool for corpora in NLP [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am trying to build my own training corpus for Named Entity Recognition, but I don't know if there is already an existing tool for this or if I have to implement one myself.
Basically, what I need to do is take a corpus and manually tag it word by word, which is pretty tedious, but it has to be done.
Can anyone tell me if there is already an existing one and where to get it?
I had a good experience working with BRAT.
GATE is also a very complex tool for annotating, steeper learning curve.
We had a nice experience using DataTurks . They provide nice intuitive UI which allows to add collaborator, insights into data, leaderboard for annotators and some other funky features.
https://dataturks.com
For online annotation of text or HTML corpus of relatively short documents I also recommend BRAT. You will have to go under the hood of the python web application if you want to do anything custom. It also failed to work for me on large HTML documents (100 or so pages).
I have also used stand-alone apps:
Protege + Knowtator: a bit cumbersome to setup / use, but it
works;
Gate: also cumbersome, and it somewhat works. Backup
your annotations at regular intervals as you might get
surprised by a stacktrace that also wiped or corrupted your annotated
corpus (which is just serialized Java objects).
If you are dealing with PDF documents, we built a web-based PDF Annotation Tool: NOTA. It accepts anything printed to PDF, including scans. We do commercial OCR on our end to recover text from images. There is a REST API to create color-coded annotation schemas and pre-populate documents with annotations, as well as a REST API for exporting formatted text and annotation offsets. There is also a JS API you can use to customize any annotation workflows, add metadata to annotations, etc. Relationships are not supported out of the box. Large documents, 200+ pages are supported. Email us and we can give you an API key to try it out. Details and documentation links can be found here. It is free for small research projects.
Here is a screenshot of what the annotations looks like :
I co-develop myself the web-based text annotation tool: tagtog.net
There is nothing to install, and you can define the type of entities you want to annotate. Additionally you can annotation relationships, document labels, and much more. You can upload your documents in many different formats, including PDF or markdown. You can annotate together with your team collaboratively. We have put great care in making the interface easy and beautiful. It looks like this:
You can start right away with a free account. Also I would be happy to help you with any doubt or issue you may have; just ping me or write us an email to the address shown on the website, tagtog.net.
Our annotation tool Prodigy is very scriptable, and is designed for active learning. It integrates especially well with our NLP library spaCy.
We've paid particular attention to the Named Entity Recogntion (NER) annotation workflows, as entity recognition can otherwise be very slow. I have a tutorial video on this:
https://www.youtube.com/watch?v=l4scwf8KeIA
There is this tool called, Dataturks is super simple to use, fully online NLP annotation tool, so that I even can easily push my teammates to complete datasets for our projects.
try TagEditor ,
It is a desktop application designed to annotate text for training with spaCy library.
You can tag Named Entities, Dependencies, Parts of speech, text categories
and print json file.
Example

OneNote like web clipping software for notes, code examples, and articles on Mac OS?

As a developer I find I am gathering more and more information from blogs and other resources from the web. Whether it be tips on configuring Drupal on IIS7 or tips on using the Entity Framework I find I am looking for a way to capture and organize content from the web. I also would like to be able to edit and annotate content to be able to add my own notes and remove add banners or any other content not related to what I am capturing.
When I used Windows OneNote seemed to fit the bill but I have recently moved to Mac OS and I am looking for an equivalent software package. I could run OneNote in a VM but would prefer to have a Mac OS native app. Here are some of the things I am looking for ..
Native app rather than web based. Because a web based product could go out of business and my collection could be lost.
Ability to organize and handle a large amount of data.
Good web clipping ability. So much of my content comes from the web.
Thanks for any suggestions!
I figured I would answer my own question with information on what I found. There ws no shortage of good apps on the Mac for note taking, web clips, and information storage.
Native Mac OS apps
DEVONthink (http://www.devon-technologies.com/products/devonthink/)
This is the application I decided to go with. It is expensive ($150 for Pro Office) but I really liked how it used the file system as it's storage medium and not a single database file. The fact that it has an nice iPhone and iPad app (DEVONthink 2 Go) make it my number one choice. Tagging and folder hierarchy was something I liked and really nice search capabilities. Also, built in OCR.
YoJimbo - http://www.barebones.com/products/yojimbo/
Very nice application with nice interface and nice reviews. I just didn't like how all content was saved to a single database file. Nice iPad app also.
Eagle Filer - http://c-command.com/eaglefiler/
Very similar to DEVONthink (minus the OCR) but the price was very affordable and it used the file system to store files in native format. I would have chose Eagle Filer if it had a companion iOS app.
Together - http://reinventedsoftware.com/together/
I thought Together had a really nice interface. I thought it was very similar to Yojimbo but no companion app (which YoJimbo has)
Curio - http://www.zengobi.com/products/curio/
This was an awesome (but expensive) application. In the end I found it to be more suited to creating content rather than storing it. I might look into this as a solution to brainstorming and content creation and use something like DEVONthink to store the content. Very generous trial period.
VooDooPad - http://flyingmeat.com/voodoopad/
VooDooPad got a lot of nice reviews. However, I wasn't too fond of the interface.
Circus Ponies Notebook - http://www.circusponies.com/
I personally didn't like the interface of Circus Ponies Notebook however this is a subjective thing. I did not like how a clipping service had to be created in order to import content.
Web Based Tools
Though I prefer a solution that ran as a native Mac OS app, I came across some nice web based applications.
ZoHo Notebook - http://notebook.zoho.com
Mnemonic - http://www.memonic.com/home
SpringPad - http://springpadit.com/home
Evernote - http://www.evernote.com
UberNote - http://www.ubernote.com/webnote/pages/default.aspx
MediaWiki - http://www.mediawiki.org/wiki/MediaWiki

Simple Geographic Data Sources for a Web App

For a web app I'm working on, I need to know the lat/lon of about 300 US cities. I also need to know the lat/lon for every US zip code.
Does anyone know of a free source to get this information?
Freebase will likely be able to help you. Here is New York and it has Lat/Long.
The Wal-Mart Expansion video used Freebase to get it's data, and it sounds like you're doing something similar.
Take a look at Ben Fry's Zip Code Visualizer. This is part of his book Visualizing Data.
There is an online database at Geocoder.us. And a simple perl module (we added a soap service to this so we could simply access it from any language, and the longest part of the development process was downloading database (it's about 4 GB).) The geocoing database is available from the US census bureau, a good article on getting this setup can be found here.

TIGER shapefiles - using and interpreting

I know of the US GIS TIGER file format from years ago, but have never used it.
I'm very shortly going to need to very quickly implement simple geocoding and vector graphics of roads and other features.
Where do I go for information - are there tutorials, example queries, etc?
Are there other ways to include geocoding and basic mapping in a mobile (no internet) device?
-Adam
As far as I'm aware of, there aren't many applications that make use of the TIGER/Line format directly. Most apps use TIGER files that have been translated into ESRI's shapefile format.
Edited to add:
Is there information on ESRI's format available?
There's an ESRI whitepaper describing the file format.
If you're planning to use shapefiles in an application, there are various libraries out there.
The OpenStreetMap project imported TIGER data, you might find useful code snippets there. See the TIGER page on the OpenStreetMap wiki for more information and links

Resources