INDEX
Explanations
numerical values indicating statistics or measurements
monetary values and statistics
New Auto-Interp
Negative Logits
anship
-0.80
rity
-0.68
lace
-0.67
anners
-0.63
icably
-0.63
athered
-0.62
ariat
-0.61
onomy
-0.60
atl
-0.60
ilyn
-0.60
POSITIVE LOGITS
istg
0.78
depending
0.70
@@
0.70
ptin
0.65
Fiat
0.65
Prediction
0.64
Divide
0.63
Sakuya
0.62
Xie
0.62
Chimera
0.61
Activations Density 0.118%