INDEX
Explanations
positive sentiments towards certain actions or behaviors
words indicating acceptance and appreciation
New Auto-Interp
Negative Logits
orio
-0.62
Milky
-0.59
guyen
-0.58
ouf
-0.54
coordinates
-0.53
ierre
-0.53
occupancy
-0.53
Delay
-0.52
lua
-0.52
iencies
-0.51
POSITIVE LOGITS
by
1.26
By
0.93
universally
0.92
by
0.91
widely
0.91
enthusiastically
0.90
everywhere
0.90
amongst
0.89
internationally
0.89
nationally
0.88
Activations Density 0.219%