INDEX
Explanations
words related to animals and animal-related activities
New Auto-Interp
Negative Logits
ICT
-0.27
ICC
-0.23
IK
-0.21
ICC
-0.20
ICU
-0.19
Ich
-0.19
IC
-0.18
Cic
-0.18
CCI
-0.18
Ik
-0.17
POSITIVE LOGITS
িà¦
0.28
×Ļ×
0.23
ÃŃ
0.23
ï
0.22
İ
0.21
ı
0.21
ì
0.20
lc
0.20
ac
0.20
inc
0.20
Activations Density 0.126%