INDEX
Explanations
mentions of differing opinions or perspectives
repeated references to the term "Others."
New Auto-Interp
Negative Logits
both
-0.71
just
-0.69
basically
-0.67
ticket
-0.66
level
-0.64
peak
-0.64
pretty
-0.63
utra
-0.60
every
-0.59
attendance
-0.59
POSITIVE LOGITS
Others
3.85
Others
2.72
Other
1.90
Another
1.43
Other
1.37
others
1.36
Many
1.25
Someone
1.23
Some
1.21
Their
1.16
Activations Density 0.016%