INDEX
Explanations
instances of verbs related to actions and consequences
phrases that indicate fear or concern regarding potential risks or threats
New Auto-Interp
Negative Logits
cember
-0.83
Tycoon
-0.75
çͰ
-0.74
borgh
-0.73
UFC
-0.72
rican
-0.71
Donald
-0.69
Rated
-0.69
razil
-0.67
usterity
-0.67
POSITIVE LOGITS
theirs
0.93
nuance
0.91
such
0.87
specifics
0.86
caveats
0.86
particulars
0.85
things
0.84
insights
0.82
anything
0.80
broader
0.80
Activations Density 0.937%