INDEX
Explanations
articles and determiners used in conjunction with nouns
New Auto-Interp
Negative Logits
fevere
-0.72
Chriftian
-0.68
Monfieur
-0.65
houſe
-0.64
Danemark
-0.64
IVEREF
-0.63
intest
-0.62
apapun
-0.62
يتيمه
-0.61
Chrif
-0.60
POSITIVE LOGITS
awtextra
0.82
مشين
0.82
which
0.67
cual
0.62
once
0.59
which
0.58
وهي
0.57
mere
0.57
すな
0.56
Mere
0.56
Activations Density 0.107%