INDEX
Explanations
references to authors and their works in academic contexts
work followed by is/was/has
New Auto-Interp
Negative Logits
évaluateur
-0.49
ſch
-0.46
Monfieur
-0.46
Tikang
-0.42
ſta
-0.42
purpoſe
-0.41
+#+#
-0.41
Verſ
-0.40
المعيارى
-0.40
ſtand
-0.38
POSITIVE LOGITS
mycin
0.44
sor
0.44
properly
0.44
properly
0.44
roup
0.44
Personendaten
0.42
HAN
0.42
MLLoader
0.42
buru
0.42
Neutral
0.42
Activations Density 0.002%