代码人生的小狗窝

一行行枯燥的代码,却描绘出人生的点点滴滴

您现在的位置是:首页>_开源软件

Tinkerpop + JanusGraph + Hbase容易实现

发布时间:2018-10-17浏览(3087)

    Tinkerpop + JanusGraph + Hbase简单实现
    Tinkerpop + JanusGraph + Hbase简单实现
    〇、机器环境说明
    192.168.1.2 master 安装配置好jdk1.8;拥有一套正常运行的Hadoop+Hbase+Zookeeper集群. ... 集群其他信息自定义.
    注:文档中的主机IP,主机名称,安装目录仅供参考,请以实际环境进行调整。
    Apache TinkerPop™是图形数据库(OLTP)和图形分析系统(OLAP)的图形计算框架。
    JanusGraph是一个可扩展的图形数据库,专门用于存储和查询包含数千亿个分布在多机群集中的顶点和边缘的图形。JanusGraph是一个事务数据库,可以支持数千个并发用户实时执行复杂的图形遍历。
    一、Hadoop+Hbase+Zookeeper的集群搭建
    此过程虽不麻烦, 但一时讲清楚, 还是很费时间的, 在这就省略了.
    二、创建目录, 下载Tinkerpop
    根据JanusGraph官方文档<<Table B.1. Version Compatibility Matrix>>提供的信息, 在此次搭建中, 采用Tinkerpop 3.2.7, JanusGraph 0.2.0, 相关地址http://docs.janusgraph.org/latest/version-compat.html.
    1. 创建目录
    [[email protected] ~]# mkdir -p /usr/local/program
    [[email protected] ~]# cd /usr/local/program/
    [[email protected] program]# pwd
    /usr/local/programe
     2. 下载Tinkerpop
    [[email protected] program]# wget http://www-eu.apache.org/dist/tinkerpop/3.2.7/apache-tinkerpop-gremlin-console-3.2.7-bin.zip # console, 相关地址http://tinkerpop.apache.org/.
    [[email protected] program]# wget http://mirror.bit.edu.cn/apache/tinkerpop/3.2.7/apache-tinkerpop-gremlin-server-3.2.7-bin.zip # server,相关地址http://tinkerpop.apache.org/.
     3. 解压Tinkerpop
    [[email protected] program]# unzip apache-tinkerpop-gremlin-server-3.2.7-bin.zip
    [[email protected] program]# unzip apache-tinkerpop-gremlin-console-3.2.7-bin.zip
    [[email protected] program]# ls -1
    apache-tinkerpop-gremlin-console-3.2.7
    apache-tinkerpop-gremlin-server-3.2.7
    apache-tinkerpop-gremlin-console-3.2.7-bin.zip
    apache-tinkerpop-gremlin-server-3.2.7-bin.zip
    三、在Tinkerpop Server上安装JanusGraph依赖
    Note: 第四节有讲述如何配置grapeConfig.xml的内容, 如果下载失败, 可参考第四节.
    1. 进入apache-tinkerpop-gremlin-server-3.2.7下载依赖
    安装方式: bin/gremlin-server.sh -i org.janusgraph janusgraph-all $VERSION, 在这里下载JanusGraph的0.2.0版本.
    [[email protected] apache-tinkerpop-gremlin-server-3.2.7]# bin/gremlin-server.sh -i org.janusgraph janusgraph-all 0.2.0
     2. 重要提示(Important, 这个地方应该重点显示, 为了好看, 就这样了)
    下面引用JanusGraph官方文档中7.4.2. Using TinkerPop Gremlin Server with JanusGraph的一句话:
    The above command uses Groovy Grape and if it is not configured properly download errors may ensue. Please refer to this section of the TinkerPop documentation for more information around setting up ~/.groovy/grapeConfig.xml.
    相关地址http://docs.janusgraph.org/latest/server.html.
    大意就是:
    以上命令使用Groovy Grape,如果没有正确配置,可能会出现下载错误。有关设置的更多信息,请参阅TinkerPop文档的这一部分~/.groovy/grapeConfig.xml。
    上文提到的地址http://tinkerpop.apache.org/docs/3.2.6/reference/#gremlin-applications.
    四、为避免下载依赖失败, 修改Groovy Grape
    Note: Windows目录: C:\Users\[User_Name]\.groovy; Linux目录: ~/.groovy
    修改内容如下:
    1. 在上述目录中创建grapeConfig.xml文件, 或目录不存在, 先创建目录.
    2. 修改文件内容如下:
    <ivysettings>
     <settings defaultResolver="downloadGrapes"/>
     <property name="m2-pattern" value="${user.home}/.m2/repository/org/apache/tinkerpop/[module]/[revision]/[module]-[revision](-[classifier]).[ext]" />
     <property name="m2-pattern-ivy" value="${user.home}/.m2/repository/org/apache/tinkerpop/[module]/[revision]/[module]-[revision](-[classifier]).pom" />
     <caches>
     <cache name="nocache" useOrigin="true" />
     </caches>
     <resolvers>
     <chain name="downloadGrapes">
     <filesystem name="local-maven2" checkmodified="true" changingPattern=".*" changingMatcher="regexp" m2compatible="true" cache="nocache">
     <artifact pattern="${m2-pattern}"/>
     <ivy pattern="${m2-pattern-ivy}"/>
     </filesystem>
     <filesystem name="cachedGrapes">
     <ivy pattern="${user.home}/.groovy/grapes/[organisation]/[module]/ivy-[revision].xml"/>
     <artifact pattern="${user.home}/.groovy/grapes/[organisation]/[module]/[type]s/[artifact]-[revision].[ext]"/>
     </filesystem>
     <ibiblio name="ibiblio" m2compatible="true"/>
     <ibiblio name="local" root="file:${user.home}/.m2/repository/" m2compatible="true"/>
     <ibiblio name="oracle" root="http://download.oracle.com/maven" m2compatible="true"/>
     </chain>
     </resolvers>
    </ivysettings>
    3. 重新执行第三节的内容.
    五、参数设置(master节点操作)
    1. 在${TINKERPOP_HOME}/conf/目录创建janusgraph-hbase-server.properties, 添加内容如下:
    gremlin.graph=org.janusgraph.core.JanusGraphFactory
    storage.backend=hbase # 存储方式为Hbase.
    storage.hostname=localhost # Zookeeper地址, 需要根据实际地址修改, 多台用','间隔.
    # 以下信息可参考, 不影响下面步骤.
    cache.db-cache = true
    cache.db-cache-clean-wait = 20
    cache.db-cache-time = 180000
    cache.db-cache-size = 0.5 
    # 以下信息可参考, 用于配置ElasticSearch索引信息, 不影响下面步骤.
    index.search.backend=elasticsearch # 索引方式.
    index.search.hostname=localhost # ElasticSearch主机地址, 多台用','间隔.
    index.search.port=9200 # ElasticSearch通信端口.
    index.search.elasticsearch.client-only=false
    2. 在${TINKERPOP_HOME}/conf/目录创建janusgraph-gremlin-server.yaml, 添加内容如下:
    host: 0.0.0.0
    port: 8182
    scriptEvaluationTimeout: 300000
    channelizer: org.apache.tinkerpop.gremlin.server.channel.WebSocketChannelizer
    graphs: {
     graph: conf/janusgraph-hbase-server.properties
    }
    plugins:
     - janusgraph.imports
    scriptEngines: {
     gremlin-groovy: {
     imports: [java.lang.Math],
     staticImports: [java.lang.Math.PI],
     scripts: [scripts/empty-sample.groovy]}}
    serializers:
     - { className: org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0, config: { ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry] }}
     - { className: org.apache.tinkerpop.gremlin.driver.ser.GryoLiteMessageSerializerV1d0, config: {ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry] }}
     - { className: org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0, config: { serializeResultToString: true }}
     - { className: org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerGremlinV1d0, config: { ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistryV1d0] }}
     - { className: org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerGremlinV2d0, config: { ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistry] }}
     - { className: org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerV1d0, config: { ioRegistries: [org.janusgraph.graphdb.tinkerpop.JanusGraphIoRegistryV1d0] }}
    processors:
     - { className: org.apache.tinkerpop.gremlin.server.op.session.SessionOpProcessor, config: { sessionTimeout: 28800000 }}
     - { className: org.apache.tinkerpop.gremlin.server.op.traversal.TraversalOpProcessor, config: { cacheExpirationTime: 600000, cacheMaxSize: 1000 }}
    metrics: {
     consoleReporter: {enabled: true, interval: 180000},
     csvReporter: {enabled: true, interval: 180000, fileName: /tmp/gremlin-server-metrics.csv},
     jmxReporter: {enabled: true},
     slf4jReporter: {enabled: true, interval: 180000},
     gangliaReporter: {enabled: false, interval: 180000, addressingMode: MULTICAST},
     graphiteReporter: {enabled: false, interval: 180000}}
    maxInitialLineLength: 4096
    maxHeaderSize: 8192
    maxChunkSize: 8192
    maxContentLength: 65536
    maxAccumulationBufferComponents: 1024
    resultIterationBatchSize: 64
    writeBufferLowWaterMark: 32768
    writeBufferHighWaterMark: 65536
    3. 在${TINKERPOP_HOME}/scripts/目录创建empty-sample.groovy,(此文件默认存在, 可直接使用.) 添加内容如下:
    def globals = [:]
    globals << [g : graph.traversal()]
    六、运行Tinkerpop Server
    [[email protected] apache-tinkerpop-gremlin-server-3.2.7]# bin/gremlin-server.sh conf/janusgraph-gremlin-server.yaml
    1. 当显示如下内容时, 说明启动成功:
    ......
    INFO] GremlinServer - Executing start up LifeCycleHook
    [INFO] Logger$info - Executed once at startup of Gremlin Server.
    [INFO] AbstractChannelizer - Configured application/vnd.gremlin-v1.0+gryo with org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0
    [WARN] AbstractChannelizer - The org.apache.tinkerpop.gremlin.driver.ser.GryoLiteMessageSerializerV1d0 serialization class is deprecated.
    [INFO] AbstractChannelizer - Configured application/vnd.gremlin-v1.0+gryo-lite with org.apache.tinkerpop.gremlin.driver.ser.GryoLiteMessageSerializerV1d0
    [INFO] AbstractChannelizer - Configured application/vnd.gremlin-v1.0+gryo-stringd with org.apache.tinkerpop.gremlin.driver.ser.GryoMessageSerializerV1d0
    [INFO] AbstractChannelizer - Configured application/vnd.gremlin-v1.0+json with org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerGremlinV1d0
    [INFO] AbstractChannelizer - Configured application/vnd.gremlin-v2.0+json with org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerGremlinV2d0
    [INFO] AbstractChannelizer - Configured application/json with org.apache.tinkerpop.gremlin.driver.ser.GraphSONMessageSerializerV1d0
    [INFO] GremlinServer$1 - Gremlin Server configured with worker thread pool of 1, gremlin pool of 32 and boss thread pool of 1.
    [INFO] GremlinServer$1 - Channel started at port 8182.
    2. 当提示以下内容时, 修改文件, 去除\t
    [ERROR] GremlinServer - Configuration file at conf/janusgraph-gremlin-server.yaml could not be found or parsed properly. [while scanning for the next token
    found character '\t(TAB)' that cannot start any token. (Do not use \t(TAB) for indentation)
     in 'reader', line 6, column 1:
     graph: conf/janusgraph-hbase-se ... 
     ^
    ]
    七、通过Tinkerpop Console测试
    1. 进入apache-tinkerpop-gremlin-console-3.2.7目录, 运行:
    [[email protected] apache-tinkerpop-gremlin-console-3.2.7]# ./bin/gremlin.sh
    Feb 08, 2018 4:57:21 PM java.util.prefs.FileSystemPreferences$1 run
    INFO: Created user preferences directory.
     
     \,,,/
     (o o)
    -----oOOo-(3)-oOOo-----
    plugin activated: tinkerpop.server
    plugin activated: tinkerpop.utilities
    plugin activated: tinkerpop.tinkergraph
    gremlin> :remote connect tinkerpop.server conf/remote.yaml
    ==>Configured localhost/127.0.0.1:8182
    gremlin> :> g.V().count()
    ==> 0
    运行无阻时, 说明部署完成. 可以继续往下进行.