INDEX
Explanations
proper nouns
occurrences of the word "Ge" and its various forms
New Auto-Interp
Negative Logits
Interstitial
-0.92
ioned
-0.82
æł
-0.75
netflix
-0.73
INGTON
-0.73
ABE
-0.72
kefeller
-0.70
sburgh
-0.69
nesses
-0.69
ividual
-0.69
POSITIVE LOGITS
ffen
1.06
cko
1.04
ORGE
1.00
elong
0.96
orgetown
0.96
ographical
0.95
orget
0.95
og
0.86
gebra
0.84
ometry
0.84
Activations Density 0.011%