INDEX
Explanations
references to the word "nerd" or similar variations
references to individuals named "Ner" or "Ud" as well as variations of "nerd"
New Auto-Interp
Negative Logits
achusetts
-0.78
croft
-0.75
Dispatch
-0.74
Hopkins
-0.74
auga
-0.73
jurisdictions
-0.73
Kasich
-0.71
ct
-0.71
sweep
-0.71
Phillips
-0.70
POSITIVE LOGITS
Ner
4.19
ner
2.65
Nerd
1.51
nerds
1.33
Ud
1.33
nerd
1.30
Fey
1.21
Pau
1.15
Nou
1.06
darts
1.04
Activations Density 0.033%