INDEX
Explanations
phrases indicating assumptions or predictions
New Auto-Interp
Negative Logits
AndEndTag
-0.62
Diweddarwch
-0.57
صوتيه
-0.57
IContainer
-0.52
nahilalakip
-0.50
orithm
-0.47
cery
-0.44
égias
-0.44
चीज़ों
-0.43
considérons
-0.43
POSITIVE LOGITS
featureID
0.45
justamente
0.39
ioutil
0.36
GTCX
0.35
penup
0.35
المعيارى
0.34
MetaObject
0.33
hObject
0.33
właśnie
0.33
precisamente
0.33
Activations Density 0.031%