INDEX
Explanations
elements related to reading and knowledge acquisition
New Auto-Interp
Negative Logits
Sinne
-0.41
stds
-0.39
itrile
-0.39
near
-0.39
ilich
-0.38
klare
-0.38
g
-0.36
onItemClick
-0.36
stücks
-0.36
personnalisés
-0.35
POSITIVE LOGITS
heard
1.22
听说
1.16
apparently
1.05
heard
1.04
apparently
0.97
Apparently
0.96
Apparently
0.95
hears
0.95
Heard
0.95
Heard
0.94
Activations Density 0.334%