INDEX
Explanations
the word "Lie" or variations of it
occurrences of the word "Lie" and variations of the name "Lieberman"
New Auto-Interp
Negative Logits
uploads
-0.71
ahime
-0.71
arthy
-0.71
runs
-0.67
anted
-0.67
liking
-0.66
orsi
-0.65
ocked
-0.65
oulos
-0.65
âĶģ
-0.64
POSITIVE LOGITS
Lie
1.55
utenant
1.38
Lie
1.31
itle
0.93
lie
0.89
glers
0.84
gey
0.84
uten
0.82
ppel
0.82
zen
0.82
Activations Density 0.010%