INDEX
Explanations
instances of the word "significant" and its variations
New Auto-Interp
Negative Logits
ject
-0.20
erman
-0.19
bage
-0.16
cri
-0.15
ÑĩеÑĢ
-0.15
å¼ı
-0.15
ermann
-0.15
oom
-0.15
ãģĦãģĨ
-0.14
igue
-0.14
POSITIVE LOGITS
portions
0.23
portion
0.22
/sign
0.21
amount
0.20
amounts
0.20
ately
0.19
enough
0.19
;y
0.19
amount
0.19
ity
0.19
Activations Density 0.034%