INDEX
Explanations
phrases indicating a conclusion or final thought
conclusive statements or summaries at the end of paragraphs
New Auto-Interp
Negative Logits
mong
-0.66
dec
-0.62
ocl
-0.60
ga
-0.58
ung
-0.57
lis
-0.57
folk
-0.56
lands
-0.56
anim
-0.56
umerable
-0.56
POSITIVE LOGITS
icia
0.84
reunited
0.78
hiba
0.72
icates
0.69
elvet
0.69
oner
0.67
CoC
0.65
aire
0.65
icating
0.64
conclud
0.64
Activations Density 0.021%