INDEX
Explanations
instances of the word "taken."
New Auto-Interp
Negative Logits
ë°©
-0.16
.='
-0.16
yped
-0.15
yk
-0.14
اÙħÛĮ
-0.14
ca
-0.14
è¹
-0.13
Ups
-0.13
____
-0.13
bum
-0.13
POSITIVE LOGITS
'gc
0.18
lili
0.17
oda
0.15
ä¹ħ
0.15
ãĥ³ãĥIJ
0.14
ãĤ°
0.14
_TP
0.14
.Xaml
0.14
æ¹ĸ
0.14
abi
0.13
Activations Density 0.015%