INDEX
Explanations
statements and questions that express opinions or arguments
a condition or implication
conditional conjunctions and punctuation
New Auto-Interp
Negative Logits
OGND
-0.70
насељу
-0.70
DoubleQuotes
-0.60
我也是
-0.59
saites
-0.58
위한
-0.57
()][
-0.56
ब्रेकडाउन
-0.55
betweenstory
-0.55
للمعارف
-0.53
POSITIVE LOGITS
if
1.34
If
1.14
when
1.09
If
1.04
hvis
0.98
если
0.95
如果
0.94
if
0.94
whenever
0.93
kapag
0.92
Activations Density 0.937%