INDEX
Explanations
references to specific individuals and their roles or contributions in a narrative context
New Auto-Interp
Negative Logits
bore
-0.14
anine
-0.14
aget
-0.14
Glob
-0.14
Cold
-0.14
adge
-0.13
Dil
-0.13
Proud
-0.13
izzas
-0.13
witness
-0.13
POSITIVE LOGITS
lify
0.15
ikel
0.14
cano
0.14
Ïģκ
0.14
³
0.14
ificio
0.14
itchens
0.13
495
0.13
iking
0.13
ulary
0.13
Activations Density 0.012%