INDEX
Explanations
proper nouns or names
mentions of the name "Tim."
New Auto-Interp
Negative Logits
Magikarp
-0.77
Reloaded
-0.69
prolifer
-0.61
traged
-0.61
permanent
-0.61
ATS
-0.60
poaching
-0.59
UGE
-0.58
Untitled
-0.57
UGH
-0.57
POSITIVE LOGITS
estamp
1.37
othy
1.34
eless
1.31
bre
1.14
mons
1.13
elines
1.12
my
1.08
eline
1.05
oleon
1.02
oshenko
0.99
Activations Density 0.040%