INDEX
Explanations
discussion or critique about politicians and political events
expressions of strong emotions and significant experiences
New Auto-Interp
Negative Logits
TEAM
-0.71
clusive
-0.70
Phase
-0.69
OG
-0.68
Formation
-0.66
Documents
-0.65
ABV
-0.65
senal
-0.64
BAT
-0.64
Coverage
-0.64
POSITIVE LOGITS
doubtless
1.20
pity
1.18
understandably
1.15
nostalg
1.14
scorn
1.12
surely
1.11
unconsciously
1.10
wonder
1.09
lament
1.08
subconscious
1.08
Activations Density 0.962%