Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am currently designing a low level network serialization protocol (in fact, a refinement of an existing protocol).
As the work progress, pen and paper documents start to show their limits: i have tons of papers, new and outdated merged together, etc... And i can't show anything to anyone since i describe the protocol using my own notation (a mix of flow chart & C structures).
I need a software that would help me to design a network protocol. I should be able to create structures, fields, their sizes, their layout, etc... and the software would generate some nice UMLish diagrams.
Sorry to say, everything I've seen so far (various serial protocols for embedded devices/networks) has used Word documents, with plain old tables showing allocations of fields to the bytes in the message. Alternatively, I've seen it done in Excel documents! It works, and people can read it.
Unfortunately, that's not helpful for automatic code generation, unless you have a very strict format in e.g. an Excel doc that you can then parse with a tool to generate some code. It would be good to have a notation that can be easily machine parsed, as well as human readable.
For showing message handshaking and sequences, a UML sequence diagram is good of course. There are lots of tools readily available to help you with that part of it.
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I am working on a project where I need to extract "technology related keywords/keyphrases" from text. For example, my text is:
"ABC Inc has been working on a project related to machine learning which makes use of the existing libraries for finding information from big data."
The extracted keywords/keyphrase should be: {machine learning, big data}.
My text documents are stored as BSON documents in MongoDb.
What are the best nlp libraries(with sufficient documentation and examples) out there to perform this task and how?
Thanks!
It looks you need to narrow down more than just keywords/key phrases and find the subject and object per sentence.
For subject/object recognition, I recommend the Stanford Parser or the Google Language API, where you send a string and get a dependency tree response.
You can test the Google API first to see if it works well with your corpus: https://cloud.google.com/natural-language/
The outcome here is a subject predicate object (SPO) triplet, where your predicate describes the relationship. You'll need to traverse the dependency graph and write a script to parse out the triplet.
Other Packages:
I use NLTK, Spacy, and Textblob frequently. If the corpus is simple, generic, and straightforward, Spacy and Textblob work well OOTB. If the corpus is highly customized, domain-specific, messy (incorrect spelling or grammar), etc. I'll use NLTK and spend more time customizing my NLP text processing pipeline with scrubbing, lemmatizing, etc. You may want to add your own custom dictionary of technology related keywords and keyphrases so that your parser can catch these if you decide to go with one of these packages.
NLTK Tutorial: http://www.nltk.org/book/
Spacy Quickstart: https://spacy.io/usage/
Textblob Quickstart: http://textblob.readthedocs.io/en/dev/quickstart.html
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 2 years ago.
Improve this question
has anyone had any experience with ISOBUS Iso-XML (ISO 11783-10)?
I would like to convert some geojson and/or shapefiles (.shp) to this format but am having trouble finding any information.
The specification of ISO 11783-10 is not free. You can buy the specification (google for "ISO 11783-10"), the documentation is high quality and affordable (I'm not affiliated in any way). The bought documents are distributed with some "license" and will be marked with your name, so buyers will be reluctant to share them. There are many concepts involved and details are cryptic, therefore you really need the specification to work with this format.
If you write source code encoding/decoding this format, it will depend on the specification. Distributing your source code + documentation might or might not infringe on the IP/license (I'm not a lawyer). Publishing detailed responses on this subject in open fora on the internet might or might not infringe.
There are quite some developers writing code which deals with ISO 11783-10, but I think the scarcity of freely available source code (e.g. on github) and open discussions on internet is a direct effect of the specification publication policy. It's sad because this domain is complex, has high impact, has many details which are very instance-specific (e.g. for a specific vehicle-manufacturer, accidental complexity), so open discussions and source code could improve productivity for many.
Agleader software SMS Basic has the option to export shapefiles to ISOBUS format.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 6 years ago.
Improve this question
The task is to process a lot of commands that needs to be saved in some kind of stack or buffer.
While one method will push the data in, there will be multiple threads or processes, that will take thees tasks one by one and process them.
Right know the Idea is to save the tasks in buffer that uses NoSQL database, so we can get object and delete it simultaneously.
I'm thinking, that for this problem probably is already solution with some kind of server or library, that handles task processing and distribution between multiple instances.
Is there such a thing?
Well the pattern implementation depends on your specific needs. Your question is too general for a better answer than the one provide as a comment by #AljoshaBre: "this is a classic - producer/consumer problem. look it up on the webz.". If you look at the wikipedia article about producer/consumer problem, you can find the pattern implementation in Java - the general pattern is small, but to address your specific needs more details are required. You say something about "task processing and distribution between multiple instances", and it leads for a more specific architectural pattern called Distributed Message Queues (some random ref). There is the Apache project ActiveMQ that aims to implement such pattern.
I found gearman, php queue manager, and solution with nodejs node-amqp and php-amqp using RabbitMQ
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
Can anyone direct me to a free people tracking library? I would like to be able to use an input image (or video) and get the position of people in it. I have looked at the Reading People Tracker but cannot compile it! I am on Linux (Ubuntu) but windows would be okay (if anyone can tell me how, this would be great). Preferably, it would be for C/C++ but java, c#, ruby and python would be okay too. Thanks in advance, ell.
Not specifically a people tracking library, but as a general tracking approach "Predator" is very highly regarded: http://info.ee.surrey.ac.uk/Personal/Z.Kalal/tld.html - Authors site includes some very impressive demonstration of results on youtube
This article fully explains an algorithm used for tracking moving people, and the accompanying source code is here; it is pure Java. You can see it in action in this video.
(Disclaimer: I'm the author; but I do think this is very useful, and have successfully used the algorithm a lot myself.)
The algorithm tracks moving objects in general, finds their bounding rectangle (which the application draws), counts the number of pixels in each objects, and consistently assigns them the same integer ID throughout the video frames.
When it comes to commercial computer vision applications, OpenCV and the Point Cloud Library aka PCL are your best friends (C++, but there are Java and C# bindings). And articles like the one linked explains how to use tools like OpenCV to accomplish full stack motion tracking. (The pure Java implementation shows how it works down to the individual pixels.)
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 7 years ago.
Improve this question
Given n number of raw URLs, I'd like to be able to classify them by:
news, blog, photo and video.
An example would be if a link directs a user to a photo, would it be enough to say that the raw link contains file extension for images to be able to classify the raw URL as photo?
As for video, blog and news, it seems it isn't enough to have a set of domains (like http://www.youtube.com) that will classify the raw URLs.
Could classification be done by examining the web content? Or are there any open source tools for this?
The only URLs that may be even somewhat reliably classified, are those that point to a distinct medium (i.e. http://foo.com/foo.jpg is most certainly an image). Otherwise, you must analyze the content of the page.
This can be a bit tricky, as Flash may contain a photo, video, or neither, without providing any searchable clue as to the content of the flash object. With enough effort, this can obviously be overcome (Google does it!), but I'm not aware of any open source resources that provide a library of media-related domains. Such data result from countless programmer-hours of effort -- an effort that typically seeks a return on investment (ROI). Case in point, ClueWeb09 is just a dataset of downloaded pages, used to test search algorithms -- not really sorted or categorized.
"Sometimes no help is the answer."