INDEX
Explanations
short phrases or sentences that allude to negative aspects or events
New Auto-Interp
Negative Logits
Cash
-0.69
Pharm
-0.63
Tes
-0.62
Weather
-0.60
Medical
-0.60
Kar
-0.58
eli
-0.57
Trust
-0.57
Kill
-0.56
Virgin
-0.56
POSITIVE LOGITS
designation
0.76
signifies
0.76
entails
0.74
altogether
0.73
anyway
0.73
refers
0.72
equals
0.71
implies
0.71
=
0.70
exists
0.69
Activations Density 5.414%