INDEX
Explanations
specific verbs and adjectives related to formal designations or classifications
terms related to official designations and classifications
New Auto-Interp
Negative Logits
perty
-0.63
elin
-0.63
uv
-0.62
vous
-0.62
onse
-0.61
umbn
-0.61
Spread
-0.61
hire
-0.60
otrop
-0.59
abama
-0.58
POSITIVE LOGITS
ifiable
0.74
marked
0.68
phas
0.67
zones
0.63
ependent
0.63
icut
0.63
classify
0.63
ocument
0.62
enemy
0.62
Crit
0.61
Activations Density 0.107%