INDEX
Explanations
phrases related to being true to something or someone
references to truthfulness and being true to oneself or others
New Auto-Interp
Negative Logits
lain
-0.73
hoops
-0.68
ioch
-0.66
nets
-0.65
adelphia
-0.64
opens
-0.64
Frazier
-0.63
Regions
-0.63
zan
-0.63
ypes
-0.62
POSITIVE LOGITS
Jane
0.72
believer
0.70
ãĤ·ãĥ£
0.66
dfx
0.66
DonaldTrump
0.65
verend
0.65
Valkyrie
0.62
ACTED
0.62
mington
0.62
Unlimited
0.60
Activations Density 0.084%