INDEX
Explanations
names of political figures
New Auto-Interp
Negative Logits
rake
-0.61
Fram
-0.61
Ĥª
-0.61
adolesc
-0.59
FSA
-0.58
Detailed
-0.57
aggregation
-0.57
context
-0.57
Roundup
-0.57
bryce
-0.56
POSITIVE LOGITS
vu
0.78
emort
0.74
warm
0.73
schild
0.72
anamo
0.71
ilver
0.71
iors
0.69
issance
0.69
gettable
0.68
enstein
0.68
Activations Density 0.152%