INDEX
Explanations
references to specific people and places
mentions of specific organizations or entities
New Auto-Interp
Negative Logits
UL
-0.93
Bucs
-0.85
Tul
-0.82
Tulsa
-0.76
Trinidad
-0.75
atri
-0.75
adoes
-0.73
Tab
-0.73
tuber
-0.73
Jacksonville
-0.72
POSITIVE LOGITS
Me
2.05
Me
1.79
ME
1.72
ME
1.56
me
1.53
Meh
1.40
Meow
1.32
Megan
1.26
Mek
1.13
Mei
1.04
Activations Density 0.175%