INDEX
Explanations
mentions of challenging or problematic situations
references to a specific situation being described in the text
New Auto-Interp
Negative Logits
rib
-0.86
ithe
-0.83
rotein
-0.81
ighters
-0.81
icket
-0.81
gling
-0.78
rik
-0.76
cott
-0.76
rica
-0.75
inker
-0.75
POSITIVE LOGITS
Situation
1.09
situation
0.98
situations
0.89
unfolding
0.87
Crisis
0.84
unfold
0.82
circumstances
0.79
Danger
0.76
Intervention
0.74
involving
0.74
Activations Density 0.024%