INDEX
Explanations
phrases indicating tense, high-stress situations or events
references to tense situations or conditions that are complex and require careful handling
New Auto-Interp
Negative Logits
bern
-0.84
ervative
-0.81
onest
-0.78
hirt
-0.75
ervatives
-0.75
leeve
-0.74
tein
-0.73
atron
-0.70
itte
-0.70
pees
-0.69
POSITIVE LOGITS
unfolding
0.79
occurring
0.76
happening
0.75
turbulence
0.72
situations
0.72
sit
0.71
Situation
0.71
witnessed
0.71
heightened
0.68
unfold
0.67
Activations Density 0.551%