INDEX
Explanations
mentions of legal or political issues
expressions of hope and resilience in challenging situations
New Auto-Interp
Negative Logits
Desktop
-0.61
uum
-0.61
Cor
-0.61
illes
-0.60
ĪĴ
-0.59
endars
-0.58
Howe
-0.58
Greenwood
-0.57
ACC
-0.57
burgh
-0.57
POSITIVE LOGITS
inois
0.79
iss
0.73
ppo
0.71
uca
0.69
Reviewer
0.68
olean
0.61
Quote
0.61
pokemon
0.61
Posted
0.60
Logged
0.60
Activations Density 0.327%