INDEX
Explanations
phrases related to discussions or arguments
sentences that convey finality or conclusions
New Auto-Interp
Negative Logits
quir
-0.81
unsus
-0.81
swiftly
-0.80
hastily
-0.74
furiously
-0.74
brightly
-0.74
discern
-0.73
satell
-0.73
subur
-0.72
disemb
-0.72
POSITIVE LOGITS
And
1.51
Somebody
1.49
Obviously
1.47
Everybody
1.47
Because
1.44
So
1.44
Whereas
1.42
That
1.41
Yeah
1.38
They
1.38
Activations Density 0.320%