Nebula Importer¶

Nebula Importer (Importer) is a standalone import tool for CSV files with NebulaGraph. Importer can read the local CSV file and then import the data into the NebulaGraph database.

Scenario¶

Importer is used to import the contents of a local CSV file into the NebulaGraph.

Advantage¶

Lightweight and fast: no complex environment can be used, fast data import.

Flexible filtering: You can flexibly filter CSV data through configuration files.

Release note¶

Release

Prerequisites¶

Before using Nebula Importer, make sure:

NebulaGraph service has been deployed. There are currently three deployment modes:
- Deploy NebulaGraph with Docker Compose
- Install NebulaGraph with RPM or DEB package
- Install NebulaGraph by compiling the source code

Schema is created in NebulaGraph, including space, Tag and Edge type, or set by parameter clientSettings.postStart.commands.

Golang environment has been deployed on the machine running the Importer. For details, see Build Go environment.

Steps¶

Configure the YAML file and prepare the CSV file to be imported to use the tool to batch write data to NebulaGraph.

Download binary package and run¶

Download the binary package directly and add execute permission to it.

Start the service.

$ ./<binary_package_name> --config <yaml_config_file_path>

Source code compile and run¶

Clone repository.
```
$ git clone -b v2.6.0 https://github.com/vesoft-inc/nebula-importer.git
```
Note

Use the correct branch. NebulaGraph 1.x and 2.x have different RPC protocols, so:
- The Nebula Importer V1 branch can only connect to NebulaGraph 1.x.
- The Nebula Importer Master branch and v2 branch can connect to NebulaGraph 2.x.
Access the directory nebula-importer.
```
$ cd nebula-importer
```
Compile the source code.
```
$ make build
```
Start the service.
```
$ ./nebula-importer --config <yaml_config_file_path>
```
Note

For details about the YAML configuration file, see configuration file description at the end of topic.

No network compilation mode¶

If the server cannot be connected to the Internet, it is recommended to upload the source code and various dependency packages to the corresponding server for compilation on the machine that can be connected to the Internet. The operation steps are as follows:

Clone repository.

$ git clone -b 2.6.0 https://github.com/vesoft-inc/nebula-importer.git

Use the following command to download and package the dependent source code.

$ cd nebula-importer
$ go mod vendor
$ cd .. && tar -zcvf nebula-importer.tar.gz nebula-importer

Upload the compressed package to a server that cannot be connected to the Internet.

Unzip and compile.

$ tar -zxvf nebula-importer.tar.gz 
$ cd nebula-importer
$ go build -mod vendor cmd/importer.go

Run in Docker mode¶

Instead of installing the Go locale locally, you can use Docker to pull the image of the Nebula Importer and mount the local configuration file and CSV data file into the container. The command is as follows:

$ docker run --rm -ti \
    --network=host \
    -v <config_file>:<config_file> \
    -v <csv_data_dir>:<csv_data_dir> \
    vesoft/nebula-importer:<version>
    --config <config_file>

<config_file>: The absolute path to the local YAML configuration file.
<csv_data_dir>: The absolute path to the local CSV data file.
<version>: NebulaGraph 2.x Please fill in 'v2'.

Note

A relative path is recommended. If you use a local absolute path, check that the path maps to the path in the Docker.

Configuration File Description¶

Nebula Importer uses configuration(nebula-importer/examples/v2/example.yaml) files to describe information about the files to be imported, the NebulaGraph server, and more. You can refer to the example configuration file: Configuration without Header/Configuration with Header. This section describes the fields in the configuration file by category.

Note

If users download a binary package, create the configuration file manually.