INDEX
Explanations
phrases that mention something "real" or related to reality
mentions of "real" in various contexts
New Auto-Interp
Negative Logits
hire
-0.73
Supported
-0.72
served
-0.71
theless
-0.71
Ħ¢
-0.68
recated
-0.68
attr
-0.68
ACK
-0.68
ilan
-0.65
illet
-0.65
POSITIVE LOGITS
isation
1.21
ignment
1.16
culprit
1.14
estate
1.09
reason
0.99
igning
0.98
igned
0.98
kicker
0.97
McCoy
0.93
polit
0.91
Activations Density 0.032%