It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 11 years ago.
Im going to make my own search engine.
When searching about search engine, crawler, and so on, I confused about Nutch.
I don’t understand what is Nutch. Is it for internal use like Lucene (correct me if Im wrong) or a framework for creating a search engine (example:google, bing, yahoo)?
Nutch is a full featured search engine - it can crawl external web sites, and it understands and respects robots.txt.
http://nutch.apache.org/about.html
Overview Nutch is open source
web-search software. It builds on
Lucene and Solr, adding web-specifics,
such as a crawler, a link-graph
database, parsers for HTML and other
document formats, etc.
Nutch can run on a single machine, but
gains a lot of its strength from
running in a Hadoop cluster
The system can be enhanced (eg other
document formats can be parsed) using
a plugin mechanism.
For more information about Nutch,
please see the Nutch wiki.
Nutch is a ready-made, configurable web crawler with a Java Servlet for performing searches. If you wanted to do this as a project, Nutch probably does too much since all that's left is creating the pages for entering searches and displaying results.
Related
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 9 years ago.
I want do use some of my own converter from html table to xls table, but I don't know where to start. The google don't show me comprehensive results. I know about Apache tika and poi, but do they have something easy to build converter? I used to read POI docs but it's just says about converting from xls-to-html most of time. What you will suggest to read. Where to search? Thank you.
It's a two-step process. I'd advise you to keep them separate.
Scrape and parse HTML to get table data
Write table data into Excel.
If the HTML is XHTML your life gets better: All you need is an XML DOM parser and some code to find the node at the root of the tree with the data.
I prefer Andy Khan's JExcel to POI. I think it's far better for dealing with Excel.
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 10 years ago.
I just started learning Erlang. My task to write a simple script for testing web applications. I hasn't found work script in the Internet, and Tsung too bulky for such a task. Is anyone can help me (give working example of script or link where I can found it)?
What would be possible to specify a URL, and concurrency, and time of testing and get the results. Thanks.
This links not help:
http://effectiveqa.blogspot.com/2009/12/minimal-erlang-script-for-load-testing.html
(not working, function example/0 undefined )
http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1
(work for socket, but I need concurrent testing)
I use for such purposes basho bench. It not so hard to start with it and add your own cases. Also it contains script, which draw all results.
Would like to build one? I would not recommend that way (because I have tried and there are so many things to consider to build one, especially spawning many processes and collecting the result back)
As you already know, I would recommend tsung, although it is bulky, it is a full load test application. I have gave up mine, and went back to tsung because could not properly handle opening/closing sockets with too many processes.
If you really want a simple one, I would use httperf. AFAKI, it works fine with single machine with multiple processes.
http://agiletesting.blogspot.ca/2005/04/http-performance-testing-with-httperf.html
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 11 years ago.
what are the best practices to process images in enterptices web applications.
I mean
storing
assign with entity
fast loading/caching
delayed / ajax loading
suitable format (png, jpeg)
on fly editing (resizing, compress)
free libs/helpers
image watermarking/copyrighting on fly
Especially, appreciated already production approaches!
As always, every project has their own requirements, restrictions and resources (The 3Rs). There is no 'super pattern' or 'one size fits all' method.
We cannot tell you how to implement you project as every project is different. It's up to you to use your skills/knowledge and experience to make informed decisions on implementation.
The 'best practice' is to individually research and learn each of the technologies/methods you have listed and gain the knowledge to know when to use them based on your projects requirements, restrictions and resources.
I use ImageMagickObject in my mvc projects. It can:
suitable format (png, jpeg)
on flyediting (resizing, compress)
freelibs/helpers image
watermarking/copyrighting on fly
fast loading/caching: may be memcached?
delayed / ajax loading: jquery is a good solution
assign with entity: Entity Framework can work with almost all databases
storing: hard question. all depend to the functionality
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 9 years ago.
I'm getting started in developing web services using JAX-WS. I'm trying to implement classes I can send between my service and client using JAXB, but having trouble getting it to work.
I've tried following the example at this site but cannot seem to get it to work. After following the instructions, the test page that's displayed doesn't have any way of defining the Die objects as is suggested by the article.
All of this is very new to me, so I'm really not sure where to go. I've checked the WSDL file linked from the test page, and my best guess is that the JAXB is not working properly as I don't see the Die class or its properties mentioned anywhere.
A good, fully worked example would be very beneficial. Does anyone know where one can be found? If it's any help, I'm working in Netbeans.
Thanks!
Here is a step by step how to for jax-ws.
http://netbeans.org/kb/docs/websvc/jax-ws.html
If you google netbeans web services you will find many examples.
Here is an excellent JAX-WS tutorial for beginners. http://java.globinch.com/enterprise-java/web-services/jax-ws/java-jax-ws-tutorial-develop-web-services-clients-consumers/
The tutorial explains how to develop web service and consumer without using a Java EE container.
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center.
Closed 13 years ago.
I've been learning Groovy & Grails recently, and in terms of developer productivity it seems to be light years ahead of other Java solutions (Spring, Struts, EJB, JSF). If I search monster.ca, for either Groovy or Grails, 0 matches are returned, which suggest Grails isn't doing too well in terms of adoption.
I realise that:
Grails is relatively new and adoption takes time
Success of a technology depends on more than just it's technical merits (e.g. marketing $)
Search results on monster.ca are at best a very rough proxy for global adoption. It's possible that lots of people are using it, just not in Canada, or Canadian companies that are using it simply aren't hiring at the moment
Are there other reasons why it hasn't been adopted to the extent it seems to "deserve"?
There are probably more people using Grails than you think. Job boards show you what are the skills people are looking for. Grails is fairly new and there are not a lot of people experienced with it out on the job market.
Grails and in particular Groovy are very close to Java. A few quick lessons in Groovy and a Java developer and quickly feel at home. You can very easily take a vanilla Java developer posting and put that person on into a position developing with grails.
I would say that you will see more Groovy/Grails postings in the future as more Java shops adopt these technologies.