INDEX
Explanations
proper nouns that start with a capital letter followed by a high concentration of capital letters within the token
proper nouns and specific titles
New Auto-Interp
Negative Logits
aisle
-0.72
carriers
-0.71
carrier
-0.70
stalk
-0.69
trademark
-0.63
controllers
-0.63
generals
-0.62
corners
-0.61
pedal
-0.60
controller
-0.60
POSITIVE LOGITS
acia
0.73
ild
0.69
izont
0.67
nergy
0.66
OTOS
0.66
Op
0.66
UE
0.66
orama
0.66
Keys
0.66
obal
0.65
Activations Density 0.255%