Ruby
Table of Contents
Introduction
This document presents example Ruby code that excercises the Thrift API. To quickly get Hypertable up and running on a single machine so that you can try out these examples, see Hypertable Standalone Installation. The source code for all of the examples in this document, along with the build and run scripts, can be found in hypertable_api_example.tgz.
Environment setup and running
To execute our ruby script, we need to tell the ruby interpreter where to find the Hypertable thrift client library scripts. To do that, we pass a -I argument to the ruby interpreter. The script also needs to know where to find the thrift_client.rb file and for that it consults the HYPERTABLE_HOME environment variable. The following bash script illustrates how to setup the environment and run a Hypertable ruby thrift client script.
HYPERTABLE_HOME=/opt/hypertable/current
export HYPERTABLE_HOME
ruby -I ${HYPERTABLE_HOME}/lib/rb hypertable_api_test.rb
Program boilerplate
The following statements are required at the top of the script for the code examples in this document.
require 'rubygems' require ENV['HYPERTABLE_HOME'] + '/lib/rb/hypertable/thrift_client' require 'time' include Hypertable::ThriftGen
Creating a thrift client
All of the examples in this document reference a pointer to a Thrift client object. The following code snippets illustrate how to create a Thrift client object connected to a ThriftBroker listening on the default port (15867) on localhost. A ThriftClient object can be allocated directly, or via the with_thrift_client method of the Hypertable module. To change the ThriftBroker location, just change "localhost" to the domain name of the machine on which the ThriftBroker is running.
begin
client = Hypertable::ThriftClient.new("localhost", 15867)
rescue TException => e
puts e.message
exit 1
end
# alternatively ...
Hypertable.with_thrift_client("localhost", 15867) do |client|
...
end
Basics
The following code snippet illustrates the basics of working with namespaces and tables. The APIs introduced include namespace_exists, namespace_create, namespace_open, namespace_get_listing, namespace_close, table_drop, and table_create.
begin
if !client.namespace_exists("test")
client.namespace_create("test")
end
ns = client.namespace_open("test")
if_exists = true
client.table_drop(ns, "Fruits", if_exists)
cf_specs = Hash.new
cf = ColumnFamilySpec.new
cf.name = "genus"
cf_specs["genus"] = cf
cf = ColumnFamilySpec.new
cf.name = "description"
cf_specs["description"] = cf
cf = ColumnFamilySpec.new
cf.name = "tag"
cf_specs["tag"] = cf
schema = Schema.new
schema.column_families = cf_specs
client.table_create(ns, "Fruits", schema)
client.namespace_create("/test/sub")
listing = client.namespace_get_listing(ns)
listing.each do |entry|
if entry.is_namespace
puts "%s\t(dir)" % entry.name
else
puts entry.name
end
end
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
Fruits sub (dir)
Convenience APIs
set_cells
The following code snippet illustrates how to create Cell objects and insert them with the set_cells API. It assumes that the Fruits table in the test namespace has been created as illustrated in the Basics example.
begin
ns = client.namespace_open("test")
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "apple"
cell.key.column_family = "genus"
cell.value = "Malus"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "apple"
cell.key.column_family = "description"
cell.value = "The apple is the pomaceous fruit of the apple tree."
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "apple"
cell.key.column_family = "tag"
cell.key.column_qualifier = "crunchy"
cells.push(cell)
client.set_cells(ns, "Fruits", cells)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
get_cells
The following code snippet illustrates how to fetch cells with the get_cells API. It assumes that the Fruits table in the test namespace has been created as illustrated in the Basics example.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
ss.columns = Array[ "description" ]
cells = client.get_cells(ns, "Fruits", ss)
cells.each { |cell| puts cell }
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=apple column_family=description column_qualifier= flag=255} value=The apple is the pomaceous fruit of the apple tree.}
set_cells_as_arrays
The following code snippet illustrates how to create CellAsArray objects and insert them with the set_cells_as_arrays API. It assumes that the Fruits table in the test namespace has been created as illustrated in the Basics example.
begin
ns = client.namespace_open("test")
cells_as_arrays = Array.new
cell_as_array = Array["orange", "genus", "", "Citrus"]
cells_as_arrays.push(cell_as_array)
cell_as_array = Array["orange", "description", "", "The orange (specifically," +
"the sweet orange) is the fruit of the citrus species" +
"Citrus × sinensis in the family Rutaceae.""Citrus"]
cells_as_arrays.push(cell_as_array)
cell_as_array = Array["orange", "tag", "juicy", ""]
cells_as_arrays.push(cell_as_array)
client.set_cells_as_arrays(ns, "Fruits", cells_as_arrays)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
get_cells_as_arrays
The following code snippet illustrates how to fetch cells with the get_cells_as_arrays API. It assumes that the Fruits table in the test namespace has been created as illustrated in the Basics example and makes use of the print_cell_as_array function defined in Appendix - helper functions.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
ss.columns = Array[ "description" ]
cells_as_arrays = client.get_cells_as_arrays(ns, "Fruits", ss)
cells_as_arrays.each { |cell_as_array| print_cell_as_array(cell_as_array) }
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{CellAsArray key={Key row=apple column_family=description column_qualifier=} value=The apple is the pomaceous fruit of the apple tree.}
{CellAsArray key={Key row=orange column_family=description column_qualifier=} value=The orange (specifically,the sweet orange) is the fruit of the citrus speciesCitrus × sinensis in the family Rutaceae.Citrus}
Creating a table
The following code snippet illustrates how to create a table with the table_create API. It introduces the API classes Schema, AccessGroupSpec, AccessGroupOptions, ColumnFamilySpec, and ColumnFamilyOptions.
begin
ns = client.namespace_open("test")
schema = Schema.new
schema.access_groups = Hash.new
schema.column_families = Hash.new
# Set table defaults
schema.access_group_defaults = AccessGroupOptions.new
schema.access_group_defaults.blocksize = 65536
schema.column_family_defaults = ColumnFamilyOptions.new
schema.column_family_defaults.max_versions = 1
# Access group "ag_normal"
ag_spec = AccessGroupSpec.new
ag_spec.defaults = ColumnFamilyOptions.new
ag_spec.defaults.max_versions = 2
ag_spec.name = "ag_normal"
schema.access_groups["ag_normal"] = ag_spec
# Column "a"
cf_spec = ColumnFamilySpec.new
cf_spec.name = "a"
cf_spec.access_group = "ag_normal"
cf_spec.value_index = true
cf_spec.qualifier_index = true
schema.column_families["a"] = cf_spec
# Column "b"
cf_spec = ColumnFamilySpec.new
cf_spec.options = ColumnFamilyOptions.new
cf_spec.options.max_versions = 3
cf_spec.name = "b"
cf_spec.access_group = "ag_normal"
schema.column_families["b"] = cf_spec
# Access group "ag_fast"
ag_spec = AccessGroupSpec.new
ag_spec.options = AccessGroupOptions.new
ag_spec.options.in_memory = true
ag_spec.options.blocksize = 131072
ag_spec.name = "ag_fast"
schema.access_groups["ag_fast"] = ag_spec
# Column "c"
cf_spec = ColumnFamilySpec.new
cf_spec.name = "c"
cf_spec.access_group = "ag_fast"
schema.column_families["c"] = cf_spec
# Access group "ag_secure"
ag_spec = AccessGroupSpec.new
ag_spec.options = AccessGroupOptions.new
ag_spec.options.replication = 5
ag_spec.name = "ag_secure"
schema.access_groups["ag_secure"] = ag_spec
# Column "d"
cf_spec = ColumnFamilySpec.new
cf_spec.name = "d"
cf_spec.access_group = "ag_secure"
schema.column_families["d"] = cf_spec
# Access group "ag_counter"
ag_spec = AccessGroupSpec.new
ag_spec.defaults = ColumnFamilyOptions.new
ag_spec.defaults.counter = true
ag_spec.defaults.max_versions = 0
ag_spec.name = "ag_counter"
schema.access_groups["ag_counter"] = ag_spec
# Column "e"
cf_spec = ColumnFamilySpec.new
cf_spec.name = "e"
cf_spec.access_group = "ag_counter"
schema.column_families["e"] = cf_spec
# Column "f"
cf_spec = ColumnFamilySpec.new
cf_spec.options = ColumnFamilyOptions.new
cf_spec.options.counter = false
cf_spec.name = "f"
cf_spec.access_group = "ag_counter"
schema.column_families["f"] = cf_spec
client.table_create(ns, "TestTable", schema)
result = client.hql_query(ns, "SHOW CREATE TABLE TestTable")
if (!result.results.empty?)
puts result.results[0]
end
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
CREATE TABLE TestTable ( d MAX_VERSIONS 1, a MAX_VERSIONS 2, INDEX a, QUALIFIER INDEX a, b MAX_VERSIONS 3, c MAX_VERSIONS 1, e MAX_VERSIONS 0 COUNTER true, f MAX_VERSIONS 0 COUNTER false, ACCESS GROUP default () BLOCKSIZE 65536, ACCESS GROUP 'ag_secure' (d) REPLICATION 5 BLOCKSIZE 65536, ACCESS GROUP 'ag_normal' (a, b) BLOCKSIZE 65536 MAX_VERSIONS 2, ACCESS GROUP 'ag_fast' (c) BLOCKSIZE 131072 IN_MEMORY true, ACCESS GROUP 'ag_counter' (e, f) BLOCKSIZE 65536 MAX_VERSIONS 0 COUNTER true ) BLOCKSIZE 65536 MAX_VERSIONS 1;
Altering a table
The following code snippet illustrates how to alter a table with the table_alter API. It assumes that the table TestTable has been created as in the Creating a table example.
begin
ns = client.namespace_open("test")
schema = client.get_schema(ns, "TestTable")
# Rename column "b" to "z"
cf_spec = schema.column_families["b"]
schema.column_families.delete("b")
cf_spec.name = "z"
schema.column_families["z"] = cf_spec
# Add column "g"
cf_spec = ColumnFamilySpec.new
cf_spec.name = "g"
cf_spec.access_group = "ag_counter"
schema.column_families["g"] = cf_spec
client.table_alter(ns, "TestTable", schema)
result = client.hql_query(ns, "SHOW CREATE TABLE TestTable")
if (!result.results.empty?)
puts result.results[0]
end
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
CREATE TABLE TestTable ( d MAX_VERSIONS 1, a MAX_VERSIONS 2, INDEX a, QUALIFIER INDEX a, z MAX_VERSIONS 3, c MAX_VERSIONS 1, e MAX_VERSIONS 0 COUNTER true, f MAX_VERSIONS 0 COUNTER false, g MAX_VERSIONS 0 COUNTER true, ACCESS GROUP default () BLOCKSIZE 65536, ACCESS GROUP 'ag_secure' (d) REPLICATION 5 BLOCKSIZE 65536, ACCESS GROUP 'ag_normal' (a, z) BLOCKSIZE 65536 MAX_VERSIONS 2, ACCESS GROUP 'ag_fast' (c) BLOCKSIZE 131072 IN_MEMORY true, ACCESS GROUP 'ag_counter' (e, f, g) BLOCKSIZE 65536 MAX_VERSIONS 0 COUNTER true ) BLOCKSIZE 65536 MAX_VERSIONS 1;
Mutator
The code snippet below illustrates how to insert cells into a table using a mutator. The APIs introduced include mutator_open, mutator_set_cells, mutator_flush, and mutator_close.
begin
ns = client.namespace_open("test")
mutator = client.mutator_open(ns, "Fruits", 0, 0)
# Auto-assigned timestamps
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "lemon"
cell.key.column_family = "genus"
cell.value = "Citrus"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "lemon"
cell.key.column_family = "tag"
cell.key.column_qualifier = "bitter"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "lemon"
cell.key.column_family = "description"
cell.value = "The lemon (Citrus × limon) is a small evergreen tree native to Asia."
cells.push(cell)
client.mutator_set_cells(mutator, cells)
client.mutator_flush(mutator)
# Explicitly-supplied timestamps
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "mango"
cell.key.column_family = "genus"
cell.key.timestamp = Time.parse("2014-06-06 16:27:15").to_i * 1000000000
cell.value = "Mangifera"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "mango"
cell.key.column_family = "tag"
cell.key.column_qualifier = "sweet"
cell.key.timestamp = Time.parse("2014-06-06 16:27:15").to_i * 1000000000
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "mango"
cell.key.column_family = "description"
cell.key.timestamp = Time.parse("2014-06-06 16:27:15").to_i * 1000000000
cell.value = "Mango is one of the delicious seasonal fruits grown in the tropics."
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "mango"
cell.key.column_family = "description"
cell.key.timestamp = Time.parse("2014-06-06 16:27:16").to_i * 1000000000
cell.value = "The mango is a juicy stone fruit belonging to the genus " +
"Mangifera, consisting of numerous tropical fruiting trees, that are" +
" cultivated mostly for edible fruits."
cells.push(cell)
client.mutator_set_cells(mutator, cells)
client.mutator_flush(mutator)
# Delete cells
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "apple"
cell.key.flag = Hypertable::ThriftGen::KeyFlag::DELETE_ROW
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "mango"
cell.key.column_family = "description"
cell.key.timestamp = Time.parse("2014-06-06 16:27:15").to_i * 1000000000
cell.key.flag = Hypertable::ThriftGen::KeyFlag::DELETE_CELL
cells.push(cell)
client.mutator_set_cells(mutator, cells)
client.mutator_flush(mutator)
client.mutator_close(mutator)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
Scanner
The following examples illustrate how to query a table using a scanner. The APIs introduced include the scanner_open, scanner_get_cells, and scanner_close functions, and the ScanSpec and RowInterval classes.
Full table scan
The following code illustrates how to do a full table scan using the scanner APIs.
begin
ns = client.namespace_open("test")
scanner = client.scanner_open(ns, "Fruits", ScanSpec.new)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=lemon column_family=description column_qualifier= flag=255} value=The lemon (Citrus × limon) is a small evergreen tree native to Asia.}
{Cell key={Key row=lemon column_family=genus column_qualifier= flag=255} value=Citrus}
{Cell key={Key row=lemon column_family=tag column_qualifier=bitter flag=255} value=}
{Cell key={Key row=mango column_family=description column_qualifier= flag=255} value=The mango is a juicy stone fruit belonging to the genus Mangifera, consisting of numerous tropical fruiting trees, that are cultivated mostly for edible fruits.}
{Cell key={Key row=mango column_family=genus column_qualifier= flag=255} value=Mangifera}
{Cell key={Key row=mango column_family=tag column_qualifier=sweet flag=255} value=}
{Cell key={Key row=orange column_family=description column_qualifier= flag=255} value=The orange (specifically,the sweet orange) is the fruit of the citrus speciesCitrus × sinensis in the family Rutaceae.Citrus}
{Cell key={Key row=orange column_family=genus column_qualifier= flag=255} value=Citrus}
{Cell key={Key row=orange column_family=tag column_qualifier=juicy flag=255} value=}
Restricted scan with ScanSpec
The following code illustrates how to do a table scan using a ScanSpec.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
# Return row range [lemon..orange)
ri = RowInterval.new
ri.start_row = "lemon"
ri.start_inclusive = true
ri.end_row = "orange"
ri.end_inclusive = false
ss.row_intervals = Array[ ri ]
# Return columns "genus", "tag:bitter", "tag:sweet"
ss.columns = Array["genus", "tag:bitter", "tag:sweet"]
# Return only most recent version of each cell
ss.versions = 1
scanner = client.scanner_open(ns, "Fruits", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=lemon column_family=genus column_qualifier= flag=255} value=Citrus}
{Cell key={Key row=lemon column_family=tag column_qualifier=bitter flag=255} value=}
{Cell key={Key row=mango column_family=genus column_qualifier= flag=255} value=Mangifera}
{Cell key={Key row=mango column_family=tag column_qualifier=sweet flag=255} value=}
HQL
hql_query
The following code illustrates how to issue HQL commands with the hql_query API. It also introduces the HqlResult class.
begin
ns = client.namespace_open("test")
result = client.hql_query(ns, "GET LISTING")
result.results.each { |str| puts str }
result = client.hql_query(ns, "SELECT * FROM Fruits WHERE ROW = 'mango'")
result.cells.each { |cell| puts cell }
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
Fruits
TestTable
^TestTable
^^TestTable
sub (namespace)
{Cell key={Key row=mango column_family=description column_qualifier= flag=255} value=The mango is a juicy stone fruit belonging to the genus Mangifera, consisting of numerous tropical fruiting trees, that are cultivated mostly for edible fruits.}
{Cell key={Key row=mango column_family=genus column_qualifier= flag=255} value=Mangifera}
{Cell key={Key row=mango column_family=tag column_qualifier=sweet flag=255} value=}
hql_query_as_arrays
The following code illustrates how to issue an HQL query with the hql_query_as_arrays API. It introduces the HqlResultAsArrays class and makes use of the function print_cell_as_array defined in Appendix - helper functions.
begin
ns = client.namespace_open("test")
result_as_arrays =
client.hql_query_as_arrays(ns, "SELECT * FROM Fruits WHERE ROW = 'lemon'")
result_as_arrays.cells.each { |cell_as_array| print_cell_as_array(cell_as_array) }
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{CellAsArray key={Key row=lemon column_family=description column_qualifier=} value=The lemon (Citrus × limon) is a small evergreen tree native to Asia.}
{CellAsArray key={Key row=lemon column_family=genus column_qualifier=} value=Citrus}
{CellAsArray key={Key row=lemon column_family=tag column_qualifier=bitter} value=}
hql_exec (mutator)
The following code illustrates how to issue an HQL command with the hql_exec API that returns a mutator.
begin
ns = client.namespace_open("test")
result = client.hql_exec(ns, "INSERT INTO Fruits VALUES ('strawberry', " +
"'genus', 'Fragaria'), ('strawberry', 'tag:fibrous', '')," +
" ('strawberry', 'description', 'The garden strawberry is" +
" a widely grown hybrid species of the genus Fragaria')",
true, false)
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "pineapple"
cell.key.column_family = "genus"
cell.value = "Ananas"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "pineapple"
cell.key.column_family = "tag"
cell.key.column_qualifier = "acidic"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "pineapple"
cell.key.column_family = "description"
cell.value = "The pineapple (Ananas comosus) is a tropical plant with " +
"edible multiple fruit consisting of coalesced berries."
cells.push(cell)
client.mutator_set_cells(result.mutator, cells)
client.mutator_flush(result.mutator)
client.mutator_close(result.mutator)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
hql_exec (scanner}
The following code illustrates how to issue an HQL command with the hql_exec API that returns a scanner.
begin
ns = client.namespace_open("test")
result = client.hql_exec(ns, "SELECT * FROM Fruits", false, true)
cells = client.scanner_get_cells(result.scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(result.scanner)
end
client.scanner_close(result.scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=lemon column_family=description column_qualifier= flag=255} value=The lemon (Citrus × limon) is a small evergreen tree native to Asia.}
{Cell key={Key row=lemon column_family=genus column_qualifier= flag=255} value=Citrus}
{Cell key={Key row=lemon column_family=tag column_qualifier=bitter flag=255} value=}
{Cell key={Key row=mango column_family=description column_qualifier= flag=255} value=The mango is a juicy stone fruit belonging to the genus Mangifera, consisting of numerous tropical fruiting trees, that are cultivated mostly for edible fruits.}
{Cell key={Key row=mango column_family=genus column_qualifier= flag=255} value=Mangifera}
{Cell key={Key row=mango column_family=tag column_qualifier=sweet flag=255} value=}
{Cell key={Key row=orange column_family=description column_qualifier= flag=255} value=The orange (specifically,the sweet orange) is the fruit of the citrus speciesCitrus × sinensis in the family Rutaceae.Citrus}
{Cell key={Key row=orange column_family=genus column_qualifier= flag=255} value=Citrus}
{Cell key={Key row=orange column_family=tag column_qualifier=juicy flag=255} value=}
{Cell key={Key row=pineapple column_family=description column_qualifier= flag=255} value=The pineapple (Ananas comosus) is a tropical plant with edible multiple fruit consisting of coalesced berries.}
{Cell key={Key row=pineapple column_family=genus column_qualifier= flag=255} value=Ananas}
{Cell key={Key row=pineapple column_family=tag column_qualifier=acidic flag=255} value=}
{Cell key={Key row=strawberry column_family=description column_qualifier= flag=255} value=The garden strawberry is a widely grown hybrid species of the genus Fragaria}
{Cell key={Key row=strawberry column_family=genus column_qualifier= flag=255} value=Fragaria}
{Cell key={Key row=strawberry column_family=tag column_qualifier=fibrous flag=255} value=}
Secondary indices
This section describes how to query tables using secondary indices. APIs introduced include the ColumnPredicate class and the column_predicates and the and_column_predicates members of the ScanSpec class. The examples assume that the table products has been created and loaded with the following HQL commands.
CREATE TABLE products ( title, section, info, category, INDEX section, INDEX info, QUALIFIER INDEX info, QUALIFIER INDEX category ); LOAD DATA INFILE 'indices_test_products.tsv' INTO TABLE products;
Value index (exact match)
The following HQL query which leverages the value index of the section column:
SELECT title FROM products WHERE section = 'books';
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "section"
column_predicate.operation = ColumnPredicateOperation::EXACT_MATCH
column_predicate.value = "books"
ss.column_predicates = Array[ column_predicate ]
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=0307743659 column_family=title column_qualifier= flag=255} value=The Shining Mass Market Paperback}
{Cell key={Key row=0321321928 column_family=title column_qualifier= flag=255} value=C++ Common Knowledge: Essential Intermediate Programming [Paperback]}
{Cell key={Key row=0321776402 column_family=title column_qualifier= flag=255} value=C++ Primer Plus (6th Edition) (Developer's Library)}
Value index (exact match with qualifier)
The following HQL query which leverages the value index of the info column:
SELECT title FROM products WHERE info:actor = 'Jack Nicholson';
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "actor"
column_predicate.operation = ColumnPredicateOperation::EXACT_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "Jack Nicholson"
ss.column_predicates = Array[ column_predicate ]
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=B00002VWE0 column_family=title column_qualifier= flag=255} value=Five Easy Pieces (1970)}
{Cell key={Key row=B002VWNIDG column_family=title column_qualifier= flag=255} value=The Shining (1980)}
Value index (prefix match)
The following HQL query which leverages the value index of the info column:
SELECT title, info:publisher FROM products WHERE info:publisher =^ 'Addison-Wesley';
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "publisher"
column_predicate.operation = ColumnPredicateOperation::PREFIX_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "Addison-Wesley"
ss.column_predicates = Array[ column_predicate ]
ss.columns = Array[ "title", "info:publisher" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=0321321928 column_family=title column_qualifier= flag=255} value=C++ Common Knowledge: Essential Intermediate Programming [Paperback]}
{Cell key={Key row=0321321928 column_family=info column_qualifier=publisher flag=255} value=Addison-Wesley Professional; 1 edition (March 10, 2005)}
{Cell key={Key row=0321776402 column_family=title column_qualifier= flag=255} value=C++ Primer Plus (6th Edition) (Developer's Library)}
{Cell key={Key row=0321776402 column_family=info column_qualifier=publisher flag=255} value=Addison-Wesley Professional; 6 edition (October 28, 2011)}
Value index (regex match)
The following HQL query which leverages the value index of the info column:
SELECT title, info:publisher FROM products WHERE info:publisher =~ /^Addison-Wesley/;
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "publisher"
column_predicate.operation = ColumnPredicateOperation::REGEX_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "^Addison-Wesley"
ss.column_predicates = Array[ column_predicate ]
ss.columns = Array[ "title", "info:publisher" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=0321321928 column_family=title column_qualifier= flag=255} value=C++ Common Knowledge: Essential Intermediate Programming [Paperback]}
{Cell key={Key row=0321321928 column_family=info column_qualifier=publisher flag=255} value=Addison-Wesley Professional; 1 edition (March 10, 2005)}
{Cell key={Key row=0321776402 column_family=title column_qualifier= flag=255} value=C++ Primer Plus (6th Edition) (Developer's Library)}
{Cell key={Key row=0321776402 column_family=info column_qualifier=publisher flag=255} value=Addison-Wesley Professional; 6 edition (October 28, 2011)}
Qualifier index (exists)
The following HQL query which leverages the qualifier index of the info column:
SELECT title FROM products WHERE Exists(info:studio);
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "studio"
column_predicate.operation = ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
ss.column_predicates = Array[ column_predicate ]
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=B00002VWE0 column_family=title column_qualifier= flag=255} value=Five Easy Pieces (1970)}
{Cell key={Key row=B000Q66J1M column_family=title column_qualifier= flag=255} value=2001: A Space Odyssey [Blu-ray]}
{Cell key={Key row=B002VWNIDG column_family=title column_qualifier= flag=255} value=The Shining (1980)}
Qualifier index (exists regex prefix match)
The following HQL query which leverages the qualifier index of the category column:
SELECT title FROM products WHERE Exists(category:/^\/Movies/);
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "category"
column_predicate.column_qualifier = "^/Movies"
column_predicate.operation = ColumnPredicateOperation::QUALIFIER_REGEX_MATCH
ss.column_predicates = Array[ column_predicate ]
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=B00002VWE0 column_family=title column_qualifier= flag=255} value=Five Easy Pieces (1970)}
{Cell key={Key row=B000Q66J1M column_family=title column_qualifier= flag=255} value=2001: A Space Odyssey [Blu-ray]}
{Cell key={Key row=B002VWNIDG column_family=title column_qualifier= flag=255} value=The Shining (1980)}
Value index (OR query)
The following HQL query performs a boolean OR combination of two lookups against the value index of the info column:
SELECT title FROM products WHERE info:author =~ /^Stephen P/ OR info:publisher =^ 'Anchor';
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
ss.column_predicates = Array.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "author"
column_predicate.operation = ColumnPredicateOperation::REGEX_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "^Stephen P"
ss.column_predicates.push(column_predicate)
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "publisher"
column_predicate.operation = ColumnPredicateOperation::PREFIX_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "Anchor"
ss.column_predicates.push(column_predicate)
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=0307743659 column_family=title column_qualifier= flag=255} value=The Shining Mass Market Paperback}
{Cell key={Key row=0321776402 column_family=title column_qualifier= flag=255} value=C++ Primer Plus (6th Edition) (Developer's Library)}
Value index (AND query)
The following HQL query performs a boolean AND combination of two lookups against the value index of the info column:
SELECT title FROM products WHERE info:author =~ /^Stephen [PK]/ AND info:publisher =^ 'Anchor';
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
ss.column_predicates = Array.new
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "author"
column_predicate.operation = ColumnPredicateOperation::REGEX_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "^Stephen [PK]"
ss.column_predicates.push(column_predicate)
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "publisher"
column_predicate.operation = ColumnPredicateOperation::PREFIX_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "Anchor"
ss.column_predicates.push(column_predicate)
ss.and_column_predicates = true
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=0307743659 column_family=title column_qualifier= flag=255} value=The Shining Mass Market Paperback}
Value index (AND row interval)
The following HQL query performs a boolean AND combination of a lookup against the value index of the info column and a ROW interval:
SELECT title FROM products WHERE ROW > 'B00002VWE0' AND info:actor = 'Jack Nicholson';
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
# ROW > 'B00002VWE0'
ri = RowInterval.new
ri.start_row = "B00002VWE0"
ri.start_inclusive = false
ss.row_intervals = Array[ ri ]
# info:actor = 'Jack Nicholson'
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "actor"
column_predicate.operation = ColumnPredicateOperation::EXACT_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "Jack Nicholson"
ss.column_predicates = Array[ column_predicate ]
ss.and_column_predicates = true
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=B002VWNIDG column_family=title column_qualifier= flag=255} value=The Shining (1980)}
Value index (AND row prefix)
The following HQL query performs a boolean AND combination of a lookup against the value index of the info column and a ROW prefix interval:
SELECT title FROM products WHERE ROW =^ 'B' AND info:actor = 'Jack Nicholson';
can be issued programmatically with the following code snippet.
begin
ns = client.namespace_open("test")
ss = ScanSpec.new
# ROW =^ 'B'
ri = RowInterval.new
ri.start_row = "B"
ri.start_inclusive = true
ri.end_row = "C"
ri.end_inclusive = false
ss.row_intervals = Array[ ri ]
# info:actor = 'Jack Nicholson'
column_predicate = ColumnPredicate.new
column_predicate.column_family = "info"
column_predicate.column_qualifier = "actor"
column_predicate.operation = ColumnPredicateOperation::EXACT_MATCH |
ColumnPredicateOperation::QUALIFIER_EXACT_MATCH
column_predicate.value = "Jack Nicholson"
ss.column_predicates = Array[ column_predicate ]
ss.and_column_predicates = true
ss.columns = Array[ "title" ]
scanner = client.scanner_open(ns, "products", ss)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=B00002VWE0 column_family=title column_qualifier= flag=255} value=Five Easy Pieces (1970)}
{Cell key={Key row=B002VWNIDG column_family=title column_qualifier= flag=255} value=The Shining (1980)}
Asynchronous APIs
This section describes how to use the asynchronous APIs. The examples assume that tables Profile and Session are created and loaded with the following HQL commands.
CREATE TABLE Profile (info, last_access MAX_VERSIONS 1);
CREATE TABLE Session (user_id, page_hit);
INSERT INTO Profile
VALUES ('1', 'info:name', 'Joe'), ('2', 'info:name', 'Sue');
Async mutator
The code snippet below illustrates how to insert cells into multiple tables simultaneously using an asynchronous mutator. The APIs introduced include the future_open, future_get_result, future_close, async_mutator_open, async_mutator_set_cells, async_mutator_flush, and async_mutator_close functions and the Result class.
begin
ns = client.namespace_open("test")
ff = client.future_open(0)
profile_mutator = client.async_mutator_open(ns, "Profile", ff, 0)
session_mutator = client.async_mutator_open(ns, "Session", ff, 0)
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "1"
cell.key.column_family = "last_access"
cell.value = "2014-06-13 16:06:09"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "2"
cell.key.column_family = "last_access"
cell.value = "2014-06-13 16:06:10"
cells.push(cell)
client.async_mutator_set_cells(profile_mutator, cells)
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "0001-200238"
cell.key.column_family = "user_id"
cell.key.column_qualifier = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "0001-200238"
cell.key.column_family = "page_hit"
cell.value = "/index.html"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "0002-383049"
cell.key.column_family = "user_id"
cell.key.column_qualifier = "2"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "0002-383049"
cell.key.column_family = "page_hit"
cell.value = "/foo/bar.html"
cells.push(cell)
client.async_mutator_set_cells(session_mutator, cells)
client.async_mutator_flush(profile_mutator)
client.async_mutator_flush(session_mutator)
result_count = 0
while true do
result = client.future_get_result(ff, 0)
if result.is_empty
break
end
result_count += 1
if result.is_error
puts "Async mutator error: " + result.error_msg
exit 1
end
if result.id == profile_mutator
puts "Result is from Profile mutation"
elsif result.id == session_mutator
puts "Result is from Session mutation"
end
end
puts "result count = %d" % result_count
client.async_mutator_close(profile_mutator)
client.async_mutator_close(session_mutator)
client.future_close(ff)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
Result is from Profile mutation Result is from Session mutation result count = 2
Async scanner (Result)
The code snippet below illustrates how to query two tables simultaneously using asynchronous scanners and a future object that returns a Result object. The APIs introduced include the async_scanner_open and async_scanner_close functions.
begin
ns = client.namespace_open("test")
ff = client.future_open(0)
ss = ScanSpec.new
ri = RowInterval.new
ri.start_row = "1"
ri.start_inclusive = true
ri.end_row = "1"
ri.end_inclusive = true
ss.row_intervals = Array[ ri ]
profile_scanner = client.async_scanner_open(ns, "Profile", ff, ss)
ss = ScanSpec.new
ri = RowInterval.new
ri.start_row = "0001-200238"
ri.start_inclusive = true
ri.end_row = "0001-200238"
ri.end_inclusive = true
ss.row_intervals = Array[ ri ]
session_scanner = client.async_scanner_open(ns, "Session", ff, ss)
while true do
result = client.future_get_result(ff, 0)
if result.is_empty
break
end
if result.is_error
puts "Async mutator error: " + result.error_msg
exit 1
end
if result.id == profile_scanner
puts "Result is from Profile scan"
elsif result.id == session_scanner
puts "Result is from Session scan"
end
result.cells.each { |cell| puts cell }
end
client.async_scanner_close(profile_scanner)
client.async_scanner_close(session_scanner)
client.future_close(ff)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
Result is from Profile scan
{Cell key={Key row=1 column_family=info column_qualifier=name flag=255} value=Joe}
{Cell key={Key row=1 column_family=last_access column_qualifier= flag=255} value=2014-06-13 16:06:09}
Result is from Session scan
{Cell key={Key row=0001-200238 column_family=user_id column_qualifier=1 flag=255} value=}
{Cell key={Key row=0001-200238 column_family=page_hit column_qualifier= flag=255} value=/index.html}
Async scanner (ResultAsArrays)
The code snippet below illustrates how to query two tables simultaneously using asynchronous scanners and a future object that returns a ResultAsArrays object. This example introduces the future_get_result_as_arrays API and makes use of the function print_cell_as_array defined in Appendix - helper functions.
begin
ns = client.namespace_open("test")
ff = client.future_open(0)
ss = ScanSpec.new
ri = RowInterval.new
ri.start_row = "1"
ri.start_inclusive = true
ri.end_row = "1"
ri.end_inclusive = true
ss.row_intervals = Array[ ri ]
profile_scanner = client.async_scanner_open(ns, "Profile", ff, ss)
ss = ScanSpec.new
ri = RowInterval.new
ri.start_row = "0001-200238"
ri.start_inclusive = true
ri.end_row = "0001-200238"
ri.end_inclusive = true
ss.row_intervals = Array[ ri ]
session_scanner = client.async_scanner_open(ns, "Session", ff, ss)
while true do
result_as_arrays = client.future_get_result_as_arrays(ff, 0)
if result_as_arrays.is_empty
break
end
if result_as_arrays.is_error
puts "Async mutator error: " + result_as_arrays.error_msg
exit 1
end
if result_as_arrays.id == profile_scanner
puts "Result is from Profile scan"
elsif result_as_arrays.id == session_scanner
puts "Result is from Session scan"
end
result_as_arrays.cells.each { |cell_as_array| print_cell_as_array(cell_as_array) }
end
client.async_scanner_close(profile_scanner)
client.async_scanner_close(session_scanner)
client.future_close(ff)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
Result is from Profile scan
{CellAsArray key={Key row=1 column_family=info column_qualifier=name} value=Joe}
{CellAsArray key={Key row=1 column_family=last_access column_qualifier=} value=2014-06-13 16:06:09}
Result is from Session scan
{CellAsArray key={Key row=0001-200238 column_family=user_id column_qualifier=1} value=}
{CellAsArray key={Key row=0001-200238 column_family=page_hit column_qualifier=} value=/index.html}
Atomic counters
This section describes how to use atomic counters. The examples assume that a Hits table has been created with the following HQL command.
CREATE TABLE Hits (count COUNTER);
Increment
The code snippet below illustrates how to increment per-second hit counts for pages of a website.
begin
ns = client.namespace_open("test")
mutator = client.mutator_open(ns, "Hits", 0, 0)
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/foo/bar.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/foo/bar.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/foo/bar.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:19"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:19"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:19"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:19"
cell.value = "1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/foo/bar.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:19"
cell.value = "1"
cells.push(cell)
client.mutator_set_cells(mutator, cells)
client.mutator_flush(mutator)
scanner = client.scanner_open(ns, "Hits", ScanSpec.new)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.mutator_close(mutator)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=/foo/bar.html column_family=count column_qualifier=2014-06-14 07:31:18 flag=255} value=3}
{Cell key={Key row=/foo/bar.html column_family=count column_qualifier=2014-06-14 07:31:19 flag=255} value=1}
{Cell key={Key row=/index.html column_family=count column_qualifier=2014-06-14 07:31:18 flag=255} value=2}
{Cell key={Key row=/index.html column_family=count column_qualifier=2014-06-14 07:31:19 flag=255} value=4}
Reset and subtraction
The code snippet below illustrates how to reset and subtract from counters.
begin
ns = client.namespace_open("test")
mutator = client.mutator_open(ns, "Hits", 0, 0)
cells = Array.new
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "=0"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "7"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/foo/bar.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:18"
cell.value = "-1"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/index.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:19"
cell.value = "-2"
cells.push(cell)
cell = Cell.new
cell.key = Key.new
cell.key.row = "/foo/bar.html"
cell.key.column_family = "count"
cell.key.column_qualifier = "2014-06-14 07:31:19"
cell.value = "=19"
cells.push(cell)
client.mutator_set_cells(mutator, cells)
client.mutator_flush(mutator)
scanner = client.scanner_open(ns, "Hits", ScanSpec.new)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.mutator_close(mutator)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
{Cell key={Key row=/foo/bar.html column_family=count column_qualifier=2014-06-14 07:31:18 flag=255} value=2}
{Cell key={Key row=/foo/bar.html column_family=count column_qualifier=2014-06-14 07:31:19 flag=255} value=19}
{Cell key={Key row=/index.html column_family=count column_qualifier=2014-06-14 07:31:18 flag=255} value=7}
{Cell key={Key row=/index.html column_family=count column_qualifier=2014-06-14 07:31:19 flag=255} value=2}
Creating unique keys
This section illustrates how to create unique keys using the create_cell_unique API. The example show how you can create unique user IDs for an application. The code assumes that a User table has been created with the following HQL command.
CREATE TABLE User (info, id TIME_ORDER desc MAX_VERSIONS 1);
The example code snippet is as follows.
begin
ns = client.namespace_open("test")
key = Key.new
key.column_family = "id"
key.row = "joe1987"
client.create_cell_unique(ns, "User", key, "")
key = Key.new
key.column_family = "id"
key.row = "mary.bellweather"
client.create_cell_unique(ns, "User", key, "")
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
begin
ns = client.namespace_open("test")
key = Key.new
key.column_family = "id"
key.row = "joe1987"
client.create_cell_unique(ns, "User", key, "")
client.namespace_close(ns)
rescue ClientException => e
if e.code == 48
puts "User name '%s' is already taken" % key.row
client.namespace_close(ns)
else
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
end
begin
ns = client.namespace_open("test")
scanner = client.scanner_open(ns, "User", ScanSpec.new)
cells = client.scanner_get_cells(scanner)
while !cells.empty? do
cells.each { |cell| puts cell }
cells = client.scanner_get_cells(scanner)
end
client.scanner_close(scanner)
client.namespace_close(ns)
rescue ClientException => e
puts "exception caught on line %d: %s" % [__LINE__, e.message]
exit 1
end
The following is example output produced by the above code snippet.
User name 'joe1987' is already taken
{Cell key={Key row=joe1987 column_family=id column_qualifier= flag=255} value=4156ed74-2abd-4b35-bada-cf3f35553622}
{Cell key={Key row=mary.bellweather column_family=id column_qualifier= flag=255} value=eb61bdeb-9890-4928-bb1a-dee43b1d1bd6}
Appendix - helper functions
The following helper function is used in the examples in this document.
def print_cell_as_array(cell_as_array)
puts "{CellAsArray key={Key row=%s column_family=%s column_qualifier=%s} value=%s}" %
[cell_as_array[0], cell_as_array[1], cell_as_array[2], cell_as_array[3]]
end

