INDEX
Explanations
words related to verifying or confirming information
New Auto-Interp
Negative Logits
onew
-0.79
enium
-0.75
psc
-0.68
bler
-0.66
theme
-0.66
neys
-0.64
cific
-0.63
olit
-0.61
anie
-0.61
ð
-0.60
POSITIVE LOGITS
authenticity
0.94
suspicions
0.94
receipt
0.92
ations
0.84
orship
0.83
confirmation
0.79
sighting
0.78
atively
0.78
validity
0.78
rumours
0.76
Activations Density 0.616%