INDEX
Explanations
occurrences of the word "contact."
New Auto-Interp
Negative Logits
bery
-0.18
ISTIC
-0.15
acock
-0.15
sters
-0.15
unos
-0.14
oted
-0.14
á»Ļng
-0.14
δια
-0.14
/null
-0.14
istr
-0.14
POSITIVE LOGITS
ually
0.32
ual
0.30
lenses
0.27
ors
0.25
ivity
0.25
UAL
0.24
lens
0.23
Lens
0.20
ible
0.19
less
0.19
Activations Density 0.033%