INDEX
Explanations
words related to unusual or out-of-the-ordinary situations or individuals
instances of the word "freak" and its variations
New Auto-Interp
Negative Logits
adr
-0.82
ournal
-0.80
eger
-0.74
arta
-0.74
ea
-0.71
rity
-0.68
sein
-0.67
dated
-0.65
akespeare
-0.64
atana
-0.64
POSITIVE LOGITS
ishly
1.18
ously
0.96
fuck
0.82
freak
0.81
istically
0.80
ulously
0.78
onom
0.78
bum
0.76
holes
0.75
hole
0.74
Activations Density 0.010%