INDEX
Explanations
phrases related to expressing strong emotion or urgency
occurrences of quoted speech
New Auto-Interp
Negative Logits
hardened
-0.83
affected
-0.80
targeted
-0.80
hosts
-0.80
frontline
-0.80
isolated
-0.79
distinguished
-0.78
backlog
-0.76
foreground
-0.76
vigil
-0.76
POSITIVE LOGITS
Hey
1.78
Fuck
1.73
Oh
1.67
Yeah
1.66
hello
1.62
Hello
1.60
yeah
1.60
fuck
1.60
sorry
1.59
please
1.59
Activations Density 0.112%