INDEX
Explanations
positive sentiments and expressions of gratitude
New Auto-Interp
Negative Logits
WWF
-0.66
ancel
-0.61
Greenpeace
-0.59
ammy
-0.57
mediation
-0.56
Griffin
-0.56
Emin
-0.53
winner
-0.53
ouses
-0.53
erman
-0.52
POSITIVE LOGITS
Magicka
0.69
":[
0.64
Bi
0.63
."[
0.61
bler
0.60
dracon
0.59
enth
0.59
physically
0.59
igr
0.58
cells
0.58
Activations Density 1.220%