Regression - IsotonicRegression - 《Alink v1.0.1 Document》

Description
Parameters
Script Example
- Code
- Results
  - Model
  - Prediction

Description

Isotonic Regression. Implement parallelized pool adjacent violators algorithm. Support single feature input or vector input(extractor one index of the vector).

Parameters

Name	Description	Type	Required？	Default Value
featureCol	Name of the feature column。	String		null
isotonic	If true, the output sequence should be increasing!	Boolean		true
featureIndex	Feature index in the vector.	Integer		0
labelCol	Name of the label column in the input table	String	✓
weightCol	Name of the column indicating weight	String		null
vectorCol	Name of a vector column	String		null
predictionCol	Column name of prediction.	String	✓

Script Example

Code

data = np.array([[0.35, 1],\
[0.6, 1],\
[0.55, 1],\
[0.5, 1],\
[0.18, 0],\
[0.1, 1],\
[0.8, 1],\
[0.45, 0],\
[0.4, 1],\
[0.7, 0],\
[0.02, 1],\
[0.3, 0],\
[0.27, 1],\
[0.2, 0],\
[0.9, 1]])
df = pd.DataFrame({"feature" : data[:,0], "label" : data[:,1]})
data = dataframeToOperator(df, schemaStr="label double, feature double",op_type="batch")
res = IsotonicRegression()\
            .setFeatureCol("feature")\
            .setLabelCol("label").setPredictionCol("result")
res.fit(data).transform(data).collectToDataframe()

Results

Model

model_id	model_info
0	{“vectorCol”:”\”col2\””,”featureIndex”:”0”,”featureCol”:null}
1048576	[0.02,0.3,0.35,0.45,0.5,0.7]
2097152	[0.5,0.5,0.6666666865348816,0.6666666865348816,0.75,0.75]

Prediction

col1	col2	col3	pred
1.0	0.9	1.0	0.75
0.0	0.7	1.0	0.75
1.0	0.35	1.0	0.6666666865348816
1.0	0.02	1.0	0.5
1.0	0.27	1.0	0.5
1.0	0.5	1.0	0.75
0.0	0.18	1.0	0.5
0.0	0.45	1.0	0.6666666865348816
1.0	0.8	1.0	0.75
1.0	0.6	1.0	0.75
1.0	0.4	1.0	0.6666666865348816
0.0	0.3	1.0	0.5
1.0	0.55	1.0	0.75
0.0	0.2	1.0	0.5
1.0	0.1	1.0	0.5