INDEX
Explanations
phrases related to statements or opinions made by specific individuals
the word "on."
New Auto-Interp
Negative Logits
AMA
-0.77
ëĭ
-0.73
INESS
-0.71
RL
-0.68
reference
-0.67
orically
-0.66
ACT
-0.65
Mini
-0.63
abo
-0.62
9999
-0.61
POSITIVE LOGITS
behalf
1.51
etime
1.11
steroids
1.10
shore
0.99
occasion
0.94
slaught
0.90
etheless
0.90
eness
0.88
weekends
0.87
eworld
0.83
Activations Density 0.193%