INDEX
Explanations
phrases indicating strong opinions or evaluations
phrases indicating the significance or impact of events or statements
New Auto-Interp
Negative Logits
anus
-0.64
celebrated
-0.63
consulted
-0.59
itable
-0.57
uled
-0.56
prototypes
-0.55
oided
-0.55
anz
-0.54
schild
-0.54
renovations
-0.54
POSITIVE LOGITS
emi
0.72
argo
0.65
Jr
0.65
HO
0.64
credibility
0.64
causation
0.62
alot
0.62
â̦)
0.61
Uk
0.61
cred
0.61
Activations Density 0.534%