You are viewing the documentation for Pilosa master. View the latest documentation for Pilosa v0.8.8 .

Input Definition

This document builds on the data import concepts introduced in Getting Started.
Here we will demonstrate creating the index’s schema and data definition. Then using this definition to import JSON data.

Create the Schema

Input definitions allow users to define a schema based on their data and to provide data to Pilosa in a more standard format like JSON. Once an input definition is created, we can send data to Pilosa as JSON, and as long as the data adheres to the definition, Pilosa will internally perform all of the appropriate mutations.

Before creating a schema, let’s create the repository index first:

curl localhost:10101/index/repository -X POST

The sample input definition schema for the “Star Trace” project is at Pilosa Getting Started repository in the input_definition.json file. Download it using:

curl -OL https://github.com/pilosa/getting-started/raw/master/input_definition.json

Run the following to create the input definition:

curl localhost:10101/index/repository/input-definition/stargazer -d @input_definition.json 

Instead of creating a stargazer frame and a language frame individually like in Getting Started, we can create multiple frames in one input definition. We can also set repo_id for multiple frames at the same time by providing field actions. There are three options for valueDestination:

Import Data

The sample data for the input definition we created above is in the json_input.json file at Pilosa Getting Started repository. Download it using:

curl -OL https://github.com/pilosa/getting-started/raw/master/json_input.json

Then run the following to import it:

curl localhost:10101/index/repository/input/stargazer -d @json_input.json 

As defined in the input definition, field name language_id maps language to a corresponding id defined in valueMap and sets the appropriate bit in the language frame. The value corresponding to field name stargazer_id is added to the stargazer frame as rowID. The data input above is equivalent to the following SetBit() operations:

curl localhost:10101/index/repository/query \
     -X POST \
     -d 'SetBit(frame="stargazer", columnID=91720568, rowID=513114)
         SetBit(frame="stargazer", columnID=91720568, rowID=513114, timestamp="2017-05-18T20:40")
         SetBit(frame="language", columnID=91720568, rowID=5)
         SetBit(frame="language", columnID=95122322, rowID=17)
     '