INDEX
Explanations
proper nouns and names of people or places
proper nouns and names
New Auto-Interp
Negative Logits
698
-0.81
662
-0.78
Marian
-0.77
confir
-0.75
Sar
-0.75
658
-0.74
val
-0.74
660
-0.73
Colombia
-0.73
Camel
-0.72
POSITIVE LOGITS
ick
1.69
icks
1.43
ICK
1.40
ck
1.27
rick
1.24
ack
1.18
ickers
1.10
Rick
1.09
ricks
1.08
acker
1.08
Activations Density 0.303%