INDEX
Explanations
words related to evaluation or judgment
phrases and terms related to the concept of rating or evaluation
New Auto-Interp
Negative Logits
amar
-0.75
undone
-0.72
auga
-0.64
INS
-0.64
antis
-0.63
ords
-0.63
peria
-0.62
imura
-0.61
itars
-0.61
aina
-0.60
POSITIVE LOGITS
rating
1.04
ration
0.95
rations
0.89
rated
0.88
rent
0.76
mble
0.75
senal
0.74
secution
0.73
theless
0.72
hyde
0.72
Activations Density 0.021%