INDEX
Explanations
phrases related to societal issues and observations about ongoing events
statements regarding social or economic crises
New Auto-Interp
Negative Logits
Contributions
-0.66
ocular
-0.65
Feather
-0.63
Romance
-0.63
sincerity
-0.63
validity
-0.62
promotional
-0.61
Packs
-0.61
Revel
-0.60
ulence
-0.59
POSITIVE LOGITS
witnessing
1.29
incarcer
1.03
witnessed
0.92
witness
0.91
seeing
0.89
electing
0.89
hower
0.83
overdue
0.81
faced
0.80
akening
0.80
Activations Density 0.224%