INDEX
Explanations
names of universities or colleges
proper nouns, specifically names of people or characters
New Auto-Interp
Negative Logits
ilver
-0.76
arily
-0.74
*/(
-0.73
egal
-0.70
itized
-0.70
keys
-0.69
agric
-0.68
matic
-0.68
eers
-0.67
ryu
-0.64
POSITIVE LOGITS
aneers
0.86
Cox
0.80
ruary
0.77
oken
0.76
Myers
0.75
Kitt
0.75
ulia
0.72
ards
0.72
Slater
0.71
Bry
0.71
Activations Density 0.038%