INDEX
Explanations
concerns related to the integration of disabled students into traditional educational settings
New Auto-Interp
Negative Logits
fun
-0.35
sätzlich
-0.34
occasionally
-0.28
obstante
-0.26
tea
-0.25
once
-0.25
fellow
-0.25
pretty
-0.23
help
-0.23
em
-0.22
POSITIVE LOGITS
<pad>
0.85
<unused43>
0.85
<unused74>
0.85
<unused80>
0.85
<unused42>
0.85
<unused41>
0.85
<unused76>
0.85
dieſe
0.85
<unused23>
0.85
<unused8>
0.85
Activations Density 0.488%