INDEX
Explanations
phrases related to commands or instructions
punctuation marks at the end of sentences
New Auto-Interp
Negative Logits
ibur
-0.97
sustained
-0.80
gunned
-0.71
sustaining
-0.70
grip
-0.70
choking
-0.69
bush
-0.67
choked
-0.67
dere
-0.66
meddling
-0.65
POSITIVE LOGITS
Each
1.60
Depending
1.52
Usually
1.44
Ideally
1.39
Afterwards
1.37
Additionally
1.36
Typically
1.36
Then
1.35
Alternatively
1.35
Otherwise
1.32
Activations Density 0.411%