INDEX
Explanations
the word "is" and various forms of the word "bis"
New Auto-Interp
Negative Logits
featureID
-0.87
CloseOperation
-0.81
InputBorder
-0.81
препратки
-0.81
ExecuteAsync
-0.80
InteropServices
-0.79
Monfieur
-0.77
TagMode
-0.77
WithIOException
-0.74
اكتوبر
-0.74
POSITIVE LOGITS
son
0.54
ii
0.48
ss
0.48
so
0.45
suis
0.45
iss
0.44
IS
0.43
sss
0.43
soni
0.43
imo
0.42
Activations Density 0.155%