INDEX
Explanations
positive outcomes and expressions of satisfaction
New Auto-Interp
Negative Logits
uca
-0.79
interstitial
-0.73
inian
-0.71
roots
-0.69
raid
-0.67
agus
-0.66
UME
-0.66
mber
-0.64
ricanes
-0.64
ockey
-0.63
POSITIVE LOGITS
Excellent
1.01
improved
0.97
improves
0.97
congr
0.95
albeit
0.95
inexpensive
0.95
satisfactory
0.91
prosper
0.89
Awesome
0.88
improving
0.88
Activations Density 3.465%