INDEX
Explanations
occurrences of specific literary references or significant quotes in a text
New Auto-Interp
Negative Logits
illard
-0.18
Ones
-0.15
mention
-0.15
sil
-0.14
lif
-0.14
ért
-0.13
Mention
-0.13
ä¿¡ç͍
-0.13
anky
-0.13
BarButtonItem
-0.13
POSITIVE LOGITS
:↵↵↵
0.28
":↵
0.27
:↵↵
0.27
"):↵
0.26
:↵
0.26
:↵
0.25
:↵↵
0.25
':↵
0.25
):↵
0.25
):↵
0.25
Activations Density 0.116%