INDEX
Explanations
words related to exaggeration or overstating information
terms related to exaggeration or overstated claims
New Auto-Interp
Negative Logits
spot
-0.84
NetMessage
-0.75
aining
-0.71
abiding
-0.70
washer
-0.68
³³³³
-0.67
tha
-0.67
avis
-0.66
cellent
-0.66
shed
-0.65
POSITIVE LOGITS
exagger
1.25
exaggeration
1.11
exaggerated
0.94
overest
0.88
inflated
0.88
distortions
0.81
modesty
0.79
guiActiveUn
0.77
200000
0.76
tales
0.75
Activations Density 0.021%