INDEX
Explanations
occurrences of the word "show" and its variations
New Auto-Interp
Negative Logits
:✨
-0.36
plegable
-0.34
kẻ
-0.30
フロ
-0.30
beloved
-0.30
EconPapers
-0.29
dintre
-0.29
зовут
-0.29
principalTable
-0.28
politics
-0.28
POSITIVE LOGITS
showing
2.11
showing
2.02
Showing
1.98
Showing
1.85
SHOWING
1.72
showed
1.57
showed
1.55
mostrando
1.47
montrant
1.44
shown
1.35
Activations Density 0.218%