INDEX
Explanations
negative numerical values associated with various measurements or scores
New Auto-Interp
Negative Logits
✨:
-0.95
"@/
-0.92
Wenger
-0.88
évaluateur
-0.86
fatis
-0.85
purpoſe
-0.85
reaſon
-0.84
raiſ
-0.82
Tallahassee
-0.81
juſt
-0.81
POSITIVE LOGITS
)−
1.05
−
1.05
−
0.95
(−
0.82
(−
0.73
=−
0.70
],
0.69
ley
0.69
̃o
0.67
Moreno
0.66
Activations Density 0.028%