INDEX
Explanations
specific names or terms of individuals or organizations containing "Lie" in them
occurrences of the word "Lie" in various contexts
New Auto-Interp
Negative Logits
arthy
-0.77
icable
-0.74
akra
-0.72
ahime
-0.72
orsi
-0.68
illed
-0.66
oulos
-0.66
icans
-0.65
smart
-0.65
anted
-0.65
POSITIVE LOGITS
utenant
1.34
Lie
1.24
Lie
1.06
uten
0.99
lie
0.91
ge
0.85
gey
0.83
itle
0.81
pard
0.81
detector
0.80
Activations Density 0.010%