How to connect or load data from Kafka to neo4j. I saw Mazerunner, DocManager in Neo4j documentation to connect Neo4j with Spark and MongoDB. Is there any such kind of projects related to Kafka and neo4j.
Thanks.
Please check this GitHub project. This is with Neo4j 2.2.3 KernelExtention.
Link: https://github.com/ducky427/neo-kafka
You may need to change the settings code for any 3.0.x version.
Another way, if you are using Neo4j Embedded version, you can simply integrate these through your custom API.
Do share, if you have found other ways to integrate.
Related
I have downloaded Gremlin Server with an intention of being able to use Gremlin to traverse a Neo4j DB.
Now, speaking of the latter, it has to be somehow added to the Gremlin Server installation, but I have difficulty finding any up-to-date guidance on how to do that. There are a few posts here on SO describing various kinds of problems people run into, but no definitive solution, much less one for the current versions of both Tinkerpop and Neo4j.
Would appreciate specific links, tips etc.
Thanks!
There is a "TIP" describing Gremlin Server configuration in the TinkerPop reference documentation found here. Basically, you -install Neo4j dependencies:
bin/gremlin-server.sh install org.apache.tinkerpop neo4j-gremlin 3.3.4
then you edit your Gremlin Server YAML configuration file to connect to your database. Gremlin Server contains a sample file to get you started and is found the /conf directory of the installation. Of critical note is this entry:
graphs: {
graph: conf/neo4j-empty.properties}
It specifies the Neo4j configuration to use and the sample one that ships with Gremlin Server looks like this:
gremlin.graph=org.apache.tinkerpop.gremlin.neo4j.structure.Neo4jGraph
gremlin.neo4j.directory=/tmp/neo4j
gremlin.neo4j.conf.dbms.auto_index.nodes.enabled=true
gremlin.neo4j.conf.dbms.auto_index.relationships.enabled=true
As you can see, the configuration basically just passes through Neo4j specific configuration to Neo4j itself. Only the first two lines are TinkerPop options. In this case, it sets up Neo4j for embedded mode, meaning Neo4j runs within the Gremlin Server JVM. You can make Gremlin Server part of a Neo4j HA cluster with instructions found in the reference documentation here.
Note that you asked for "current" versions of both TinkerPop and Neo4j. While these instructions are current for TinkerPop, I'm afraid that the Neo4j version TinkerPop supports is well behind their latest release. It would be nice if someone had time to issue a pull request for that.
I intend to use the Neo4j-community database in version 3.4.5 with Gremlin commands.
But I tried getting plugins but none are working for the later versions of neo4j.
Can someone outline how I can resolve my requirements?
As of right now, TinkerPop's neo4j-gremlin is built to work with Neo4j 3.2.3. That dependency is set via:
https://github.com/neo4j-contrib/neo4j-tinkerpop-api-impl/blob/0.7-3.2.3/pom.xml#L23
To see an upgrade, that library would need to be bumped to the version of Neo4j that you want and then neo4j-gremlin would need to be updated in TinkerPop:
https://github.com/apache/tinkerpop/blob/3.3.3/neo4j-gremlin/pom.xml#L120
Obviously I don't know if there would be required code changes in either repository for this to work.
I am new to the neo4j-gremlin. I have downloaded the latest version of tinkerpop-3.3.1 and neo4j-community-3.3.3 on Windows machine.
I followed the installation instructions given on the tinkerpop site for Neo4j-Gremlin and was able to install neo4j-gremlin plugin but when i try to connect to the neo4j graph.db using gremlin console(graph = Neo4jGraph.open('C:/neo4j/data/databases/graph.db'))
I get the following error. Also the display stack trace option is also not available. It jumps directly to gremlin prompt. Any thoughts?
Error instantiating Neo4j Database for C:/neo4j/data/databases/graph.db
I was able to bring this setup with gremlin-server running Neo4j plugin. You can look at my GitHub project https://github.com/achandak123/gremlin-orm-neo4j. Don't worry about the README version talking about 2.3 n all. This works with tinker pop 3.3.1 and neo4j plugin
I'm going to try Neo4j Enterprise embedded database. Right now I can't find any tutorials that explain how to configure and use Neo4j Enterprise embedded database togher with Maven and Spring.
In order to use embedded Neo4j Enterprise should I download Neo4j zip archive and start it or Maven dependency should be enough ? I don't understand collaboration between Java configuration and for example downloaded distributions.
Could you please tell me the proper way how to use Neo4j Enterprise embedded database together with my application ?
If you really need to embed Neo4j into an application, you most likely just need the core and kernel apis which is under the org.neo4j:neo4j dependency not the enterprise dependencies (org.neo4j:neo4j-enterprise).
Your maven config would have the following dependency:
<dependency>
<groupId>org.neo4j</groupId>
<artifactId>neo4j</artifactId>
<version>NEO4J VERSION HERE</version>
</dependency>
Your question makes me think that you may really want to just run Neo4j as a server and connect to it via REST or any of it's BOLT drivers. You don't download and start the embedded version - you do that with the server. Can you clarify exactly what you want to do?
Just add a neo4j dependency on org.neo4j:neo4j-enterprise:<version>
If you don't need HA that's all you need.
I'm a beginner of databases and I want to deploy neo4j on Google Cloud Platform.
I can find something about deploying MongoDB on Google, but nothing about neo4j.
So I wonder does Google Cloud support neo4j?
Thanks!
Neo4j is an open source project that could run your own Linux machine.
You could just create a Google Compute Engine instance, and follow the tutorials on the web to setup your Neo4j.
like this one: Neo4j setup instruction
just follow the Linux part, and I suggest you to use Debian image to create your instance for Neo4j, because the command-line tool on Debian is most like to Ubuntu's one.
Updated answer from 2018.
Yes -- neo4j supports Google Cloud. Instructions can be found on their website. You can use a pre-built image and launch a single node instance, or multi-node clusters on GCP.