INDEX
Explanations
phrases related to different forms of lying
instances of the word "lie" in various contexts
New Auto-Interp
Negative Logits
connecting
-0.65
pip
-0.63
access
-0.61
Vital
-0.60
tabs
-0.59
rounded
-0.59
stats
-0.59
pool
-0.59
upgraded
-0.59
scheduled
-0.59
POSITIVE LOGITS
lie
5.00
lies
2.16
Lie
1.66
lied
1.46
lia
1.44
lio
1.43
li
1.42
Lie
1.39
lying
1.23
leigh
1.19
Activations Density 0.011%