INDEX
Explanations
assertions of opinion or statements about facts
New Auto-Interp
Negative Logits
prefixer
-0.61
others
-0.60
Baillargeon
-0.57
THERE
-0.56
there
-0.56
đó
-0.53
นั้น
-0.53
istnieje
-0.53
this
-0.52
đây
-0.52
POSITIVE LOGITS
why
1.20
because
0.94
where
0.93
how
0.92
true
0.80
porque
0.74
perché
0.74
despite
0.73
mybatisplus
0.71
assuming
0.70
Activations Density 0.292%