Description

Transform data type from Kv to Csv.

Parameters

Name Description Type Required? Default Value
handleInvalid Strategy to handle unseen token String “ERROR”
reservedCols Names of the columns to be retained in the output table String[] null
csvCol Name of the CSV column String
schemaStr Formatted schema String
csvFieldDelimiter Field delimiter String “,”
quoteChar quote char Character “\””
kvCol Name of the KV column String
kvColDelimiter Delimiter used between key-value pairs when data in the input table is in sparse format String “,”
kvValDelimiter Delimiter used between keys and values when data in the input table is in sparse format String “:”

Script Example

Code

  1. import numpy as np
  2. import pandas as pd
  3. data = np.array([['1', '{"f0":"1.0","f1":"2.0"}', '$3$0:1.0 1:2.0', 'f0:1.0,f1:2.0', '1.0,2.0', 1.0, 2.0],
  4. ['2', '{"f0":"4.0","f1":"8.0"}', '$3$0:4.0 1:8.0', 'f0:4.0,f1:8.0', '4.0,8.0', 4.0, 8.0]])
  5. df = pd.DataFrame({"row":data[:,0], "json":data[:,1], "vec":data[:,2], "kv":data[:,3], "csv":data[:,4], "f0":data[:,5], "f1":data[:,6]})
  6. data = dataframeToOperator(df, schemaStr="row string, json string, vec string, kv string, csv string, f0 double, f1 double",op_type="stream")
  7. op = KvToCsvStreamOp()\
  8. .setKvCol("kv")\
  9. .setReservedCols(["row"]).setCsvCol("csv").setSchemaStr("f0 double, f1 double")\
  10. .linkFrom(data)
  11. op.print()
  12. StreamOperator.execute()

Results

row csv
1 1.0,2.0
2 4.0,8.0