INDEX
Explanations
references to the term 'IB' and related variations in different contexts
New Auto-Interp
Negative Logits
igner
-0.19
egasus
-0.16
angen
-0.16
-0.16
Sabb
-0.16
ần
-0.15
ượng
-0.15
zure
-0.15
onas
-0.14
onse
-0.14
POSITIVE LOGITS
upro
0.31
rahim
0.23
clc
0.20
érica
0.19
ib
0.19
(ib
0.19
IB
0.18
Times
0.17
iza
0.17
_ib
0.17
Activations Density 0.006%