INDEX
Explanations
sentences or phrases involving direct speech
instances of spoken dialogue or direct speech
New Auto-Interp
Negative Logits
folio
-0.83
abase
-0.80
Ranked
-0.78
unning
-0.71
mite
-0.69
osponsors
-0.68
ardless
-0.68
idious
-0.67
imates
-0.66
resorts
-0.66
POSITIVE LOGITS
loudly
1.21
aloud
1.10
hello
1.04
plaint
0.98
Goodbye
0.96
goodbye
0.96
softly
0.95
angrily
0.90
unint
0.90
calmly
0.84
Activations Density 0.233%