INDEX
Explanations
sentences or phrases with a positive emotional tone
New Auto-Interp
Negative Logits
Compare
-0.68
yz
-0.63
Rothschild
-0.62
Lauder
-0.61
..................
-0.58
Tours
-0.55
PN
-0.54
Santorum
-0.54
Compare
-0.53
Pwr
-0.52
POSITIVE LOGITS
ELF
1.16
ullivan
1.02
selves
0.97
own
0.95
lightly
0.94
pecially
0.93
ources
0.91
leeve
0.90
avior
0.90
ustainable
0.88
Activations Density 0.614%