INDEX
Explanations
phrases starting with a speaker's continuation of a previous statement
repetitive phrases or structures in dialogue or narrative
New Auto-Interp
Negative Logits
wcsstore
-0.81
ãĥİ
-0.71
Frameworks
-0.71
ousands
-0.70
eros
-0.70
ĪĴ
-0.67
arta
-0.67
icial
-0.63
士
-0.62
ICAN
-0.61
POSITIVE LOGITS
sarcast
1.09
omin
0.96
stating
0.95
summar
0.92
noting
0.90
quoting
0.90
stressing
0.88
reiter
0.87
:]
0.87
bluntly
0.85
Activations Density 0.112%