INDEX
Explanations
phrases that indicate personal impact or experience related to societal issues
impact or affect
New Auto-Interp
Negative Logits
idi
-0.31
spit
-0.29
classnames
-0.28
kla
-0.28
honors
-0.28
ESC
-0.28
gave
-0.26
onCreateView
-0.26
super
-0.25
performed
-0.25
POSITIVE LOGITS
impact
1.05
affect
1.02
impact
0.98
affects
0.98
Impact
0.97
affected
0.96
Impact
0.95
impacts
0.95
Affected
0.95
affect
0.95
Activations Density 0.111%