INDEX
Explanations
proper nouns related to individuals and their roles or performances
phrases indicating continuous actions or states related to performance
New Auto-Interp
Negative Logits
Feminist
-0.73
reviewer
-0.70
Orwell
-0.69
Jou
-0.68
bureaucr
-0.68
censor
-0.66
Apart
-0.66
microsoft
-0.65
Lauder
-0.65
Capitalism
-0.65
POSITIVE LOGITS
preseason
0.95
teammates
0.91
batted
0.88
bounced
0.88
undrafted
0.87
Coach
0.86
cussion
0.85
teammate
0.84
coached
0.83
juries
0.83
Activations Density 0.356%