INDEX
Explanations
causal relationships and connections in scientific explanations
New Auto-Interp
Negative Logits
+#+#
-0.60
gatsby
-0.55
Италијани
-0.54
Chwiliwch
-0.54
خصة
-0.51
IMPORTED
-0.51
LookAnd
-0.51
صوتيه
-0.50
pinulongan
-0.47
لبة
-0.47
POSITIVE LOGITS
thereby
1.10
thus
1.04
hence
0.86
thus
0.82
从而
0.81
therefore
0.80
consequently
0.78
Thus
0.76
Thus
0.74
hence
0.73
Activations Density 0.556%