INDEX
Explanations
strong emotional or decisive actions or statements related to various situations
words related to actions or processes commonly associated with criminal or contentious contexts
New Auto-Interp
Negative Logits
Disclaimer
-0.73
absence
-0.69
aisle
-0.68
identification
-0.68
divergence
-0.67
takeaway
-0.66
implication
-0.66
variation
-0.64
objective
-0.63
outcome
-0.62
POSITIVE LOGITS
itates
1.03
ifies
1.02
izes
1.00
ously
0.92
posed
0.90
aciously
0.88
cedes
0.88
isively
0.88
anted
0.87
igrated
0.86
Activations Density 0.362%