INDEX
Explanations
instances of high activation patterns in data analysis
scientific and foreign terms
New Auto-Interp
Negative Logits
betweenstory
-0.63
DockStyle
-0.60
defStyle
-0.57
Personendaten
-0.56
+#+
-0.51
[*]
-0.49
VersionUID
-0.47
للاسماء
-0.46
nakalista
-0.43
defStyleAttr
-0.43
POSITIVE LOGITS
türlü
0.51
caseros
0.49
nahilalakip
0.49
domestiques
0.47
তথ্যসূত্র
0.46
disfraz
0.46
Schwer
0.45
scientifiques
0.44
vraie
0.44
scientifique
0.44
Activations Density 0.481%