INDEX
Explanations
references and mentions of concepts or items in the text
New Auto-Interp
Negative Logits
parker
-0.69
guts
-0.62
alá
-0.62
Kamil
-0.60
recevrez
-0.56
Wei
-0.56
Wal
-0.56
STL
-0.56
livers
-0.55
Ellie
-0.55
POSITIVE LOGITS
Mention
1.76
mention
1.74
mentions
1.71
Mentions
1.68
mentioning
1.65
Mention
1.63
mention
1.56
mentioned
1.55
Mentioned
1.46
mentions
1.42
Activations Density 0.047%