Google Prediction API always returns a score of 1.0 or 0.0 -
i'm using the insert function of google-api-ruby-client , passing in array of traininginstances.
when call predict used scores 0.653264 , 0.346736 (using 2 output categories).
after doing refactoring scores of 1.0 , 0.0 , no probabilities in between.
the prediction accuracy (measured me) still close accuracy given status function looks trained model working. want know why i'm not getting scores in between 1 , 0 use in application.
any insight appreciated.
here sample of training data i'm using:
"completed registration",32.54,2,0,1,12 "completed registration",27.05,2,0,1,7 "completed registration",27.29,1,0,3,7 "completed registration",24.15,1,0,1,6 "completed registration",26.36,2,0,1,6 "completed registration",27.0,3,1,3,5 "completed registration",22.15,3,1,3,5 "completed registration",27.9,2,1,1,1 "completed registration",21.21,2,0,3,0 "completed registration",24.36,3,0,3,0 "completed registration",53.52,2,1,3,0 "completed registration",24.15,8,0,3,-6 "cancelled registration",,2,0,3,76 "cancelled registration",,2,0,3,76 "cancelled registration",,3,1,3,69 "cancelled registration",38.27,5,0,1,63 "cancelled registration",37.25,2,0,3,50 "cancelled registration",35.59,5,1,3,49 "cancelled registration",19.52,2,0,3,30 "cancelled registration",26.7,2,0,3,28
i've run tests on google api explorer , uploaded training data in csv file.
the data first uploaded (a few weeks ago) results in range of scores. exact same data uploaded today results in scores of 1 or 0.
could character encoding problem?
in case other people have same question i'm posting answer here. turns out internally api can use different classifiers depending on training data. looks difference caused change api , not refactoring.
here explanation @ google https://groups.google.com/forum/#!topic/prediction-api-discuss/7qphonkhzjc
Comments
Post a Comment