INDEX
Explanations
phrases indicating multiple individuals along with negative or impactful events
mentions of individuals and their involvement in various incidents or events
New Auto-Interp
Negative Logits
Temperature
-0.72
Natural
-0.67
tnc
-0.67
Parables
-0.66
à¤
-0.66
ãĤ¦
-0.66
Leader
-0.64
osterone
-0.62
itiveness
-0.61
Order
-0.61
POSITIVE LOGITS
apiece
1.02
accused
0.97
implicated
0.94
boarded
0.93
died
0.82
indicted
0.81
abducted
0.80
whom
0.78
parach
0.78
suspected
0.77
Activations Density 0.197%