INDEX
Explanations
information or details that bring about surprise or a significant new understanding
terms related to significant disclosures or surprises
New Auto-Interp
Negative Logits
animate
-0.79
ucky
-0.74
osh
-0.70
chn
-0.70
resp
-0.70
cium
-0.69
cul
-0.69
acci
-0.69
veyard
-0.68
osuke
-0.67
POSITIVE LOGITS
revelation
1.27
revelations
1.27
Revelations
0.87
reveals
0.83
Leaks
0.82
revealed
0.81
disclosures
0.76
aloud
0.76
reveal
0.74
loopholes
0.71
Activations Density 0.011%