INDEX
Explanations
phrases related to the concept of the truth or critical assessment of situations
instances of the word "lie" and its variations, indicating discussions of hidden truths or deceptions
New Auto-Interp
Negative Logits
aldi
-0.81
aud
-0.79
obs
-0.75
iles
-0.71
ilar
-0.69
smart
-0.68
ains
-0.66
inky
-0.66
ISO
-0.65
ilic
-0.64
POSITIVE LOGITS
uten
0.95
utenant
0.82
detector
0.81
lie
0.79
Lies
0.77
dormant
0.72
vulner
0.72
Reincarn
0.71
lie
0.67
ppe
0.66
Activations Density 0.011%