INDEX
Explanations
references to the Irish character or identity
New Auto-Interp
Negative Logits
dana
-0.16
зÑĮ
-0.15
ilded
-0.15
extr
-0.15
ows
-0.15
egral
-0.15
oldown
-0.15
atar
-0.15
PyObject
-0.14
enin
-0.14
POSITIVE LOGITS
Ir
0.24
regular
0.24
ir
0.23
irr
0.22
replace
0.20
Irr
0.19
(IR
0.18
land
0.17
(ir
0.17
ving
0.17
Activations Density 0.021%