Evaluation
Compare
Pairwise comparison for preference-based training (DPO). Compare two responses and determine which is better.
POST
cURL
Authorizations
API key obtained from Labs Portal
Body
application/json
Response
Success
Which response won: a, b, or tie
Available options:
a, b, tie Reward score for response_a from 0 to 1
Required range:
0 <= x <= 1Reward score for response_b from 0 to 1
Required range:
0 <= x <= 1Absolute difference between reward_a and reward_b
Version of the scoring algorithm used
cURL