INDEX
Explanations
references to geek culture and interests
New Auto-Interp
Negative Logits
chamber
-0.15
noop
-0.15
oley
-0.15
ÑĩеÑĢ
-0.14
Ngh
-0.14
-0.14
è«
-0.14
plit
-0.14
locale
-0.13
olley
-0.13
POSITIVE LOGITS
ner
0.39
Ner
0.35
nerd
0.34
ge
0.32
ner
0.32
geek
0.32
/ge
0.29
NER
0.28
ge
0.27
Geek
0.26
Activations Density 0.083%