INDEX
Explanations
phrases where something is being criticized or dismissed as unworthy or untrue
instances of dismissal or rejection regarding various statements or agreements
New Auto-Interp
Negative Logits
rolet
-0.86
OTAL
-0.73
rol
-0.67
width
-0.67
rone
-0.66
rous
-0.65
ppa
-0.63
PsyNetMessage
-0.62
atl
-0.62
Atk
-0.62
POSITIVE LOGITS
well
0.98
opposed
0.91
pired
0.90
soon
0.89
follows
0.89
pires
0.84
phy
0.80
criptions
0.79
well
0.78
ylum
0.77
Activations Density 0.147%