INDEX
Explanations
phrases that express causal relationships or consequences in arguments
New Auto-Interp
Negative Logits
متعلقه
-0.85
Grâce
-0.83
GEBURTSDATUM
-0.82
yntaxException
-0.82
الحياه
-0.81
للاسماء
-0.79
propOrder
-0.79
autorytatywna
-0.78
featureID
-0.77
صوتيه
-0.72
POSITIVE LOGITS
Certainly
1.04
Certainly
1.00
certainly
0.90
certainly
0.86
certamente
0.80
Indeed
0.80
even
0.80
Indeed
0.76
Presumably
0.74
indeed
0.71
Activations Density 1.271%