INDEX
Explanations
references to specific schools and locations
New Auto-Interp
Negative Logits
ofire
-0.17
GANG
-0.16
Garrison
-0.16
Straw
-0.15
Tort
-0.15
oggler
-0.15
inue
-0.15
Baz
-0.14
PEN
-0.14
Fond
-0.14
POSITIVE LOGITS
Worcester
0.23
Auburn
0.18
Telegram
0.18
telegram
0.17
Ass
0.17
ehr
0.17
Leicester
0.17
Holden
0.16
Clinton
0.16
Dudley
0.16
Activations Density 0.009%