INDEX
Explanations
references to educational institutions and their transformations
New Auto-Interp
Negative Logits
akadem
-0.15
idor
-0.14
alla
-0.14
ÙĪÛĮÙĦ
-0.14
adoo
-0.14
ocre
-0.14
Hague
-0.14
extView
-0.14
ode
-0.13
Forge
-0.13
POSITIVE LOGITS
threshold
0.30
Threshold
0.27
higher
0.25
thresholds
0.24
threshold
0.23
disciplinary
0.22
Threshold
0.21
liberal
0.21
Liberal
0.21
Higher
0.20
Activations Density 0.006%