INDEX
Explanations
references to prominent individuals and their actions or statements
New Auto-Interp
Negative Logits
localidad
-0.37
réc
-0.36
térm
-0.35
économ
-0.34
зонта
-0.33
Waite
-0.32
Foreground
-0.32
Modern
-0.32
ladrillo
-0.32
Jerusalén
-0.32
POSITIVE LOGITS
webElement
0.75
PMailer
0.61
ligiloj
0.55
الرياضيه
0.52
RectangleBorder
0.51
BorderSide
0.50
期刊论文
0.50
masked
0.49
嘧
0.48
serpentine
0.47
Activations Density 0.052%