INDEX
Explanations
references to teaching and learning contexts
New Auto-Interp
Negative Logits
uki
-0.18
chop
-0.16
á»ĩn
-0.15
leton
-0.15
Leban
-0.14
ardi
-0.14
ç¶Ń
-0.14
iams
-0.14
TOT
-0.14
clare
-0.14
POSITIVE LOGITS
Noon
0.15
zl
0.15
bao
0.14
Muss
0.14
ppo
0.14
Baths
0.14
itas
0.14
/front
0.14
شد
0.14
ythe
0.14
Activations Density 0.777%