INDEX
Explanations
defining characteristics or attributes of people and entities
New Auto-Interp
Negative Logits
AndEndTag
-0.80
PreferredItem
-0.77
EndContext
-0.75
CloseOperation
-0.73
Попис
-0.73
ItemBackground
-0.71
kháu
-0.67
SharedDtor
-0.66
RectangleBorder
-0.66
فريبيس
-0.66
POSITIVE LOGITS
ⓘ
0.63
heureuse
0.56
holdet
0.55
dasarkan
0.52
montagna
0.52
KURZBESCHREIBUNG
0.51
contenus
0.50
<table>
0.49
httphttps
0.48
}")]
0.48
Activations Density 0.045%