INDEX
Explanations
expressions or mentions of gratitude towards support
expressions of gratitude and mentions of support
New Auto-Interp
Negative Logits
ãĥ£
-0.70
kered
-0.66
vern
-0.63
pores
-0.63
iren
-0.63
Hebdo
-0.63
sweat
-0.62
contrad
-0.61
dare
-0.60
unbeliev
-0.60
POSITIVE LOGITS
Support
0.78
support
0.77
hesis
0.75
heses
0.75
orship
0.75
bands
0.75
Supports
0.75
enza
0.74
asio
0.72
ament
0.71
Activations Density 0.051%