INDEX
Explanations
specific sequences of letters that correspond to names or identifiers
words starting with W
New Auto-Interp
Negative Logits
qrst
-0.51
TemporalType
-0.50
tagHelper
-0.46
anyeol
-0.44
permitAll
-0.44
Kild
-0.42
raí
-0.41
idiv
-0.41
.*")]
-0.40
MENAFN
-0.40
POSITIVE LOGITS
wak
0.69
ww
0.69
wo
0.68
wat
0.67
waf
0.66
wed
0.66
w
0.65
שוליים
0.65
wc
0.64
wy
0.63
Activations Density 0.056%