INDEX
Explanations
various forms of the word "mention" and its related context in texts
New Auto-Interp
Negative Logits
parker
-0.63
Wal
-0.59
owa
-0.57
不懂
-0.56
Wy
-0.54
Wal
-0.54
guts
-0.53
Kamil
-0.51
Parr
-0.51
Gwyn
-0.50
POSITIVE LOGITS
Mention
2.25
mention
2.23
mentions
2.17
mentioning
2.17
Mention
2.10
Mentions
2.03
mention
2.02
mentioned
1.99
Mentioned
1.90
mentions
1.71
Activations Density 0.056%