INDEX
Explanations
mentions of different variants
occurrences of the word "variant" and its related forms
New Auto-Interp
Negative Logits
ħĭ
-0.80
yer
-0.76
Ķ
-0.73
usalem
-0.72
olulu
-0.71
yers
-0.71
itness
-0.70
cept
-0.68
ashington
-0.67
mberg
-0.66
POSITIVE LOGITS
variants
1.30
variant
1.24
allele
0.94
variations
0.87
versions
0.85
surn
0.83
vari
0.78
costumes
0.78
variation
0.77
ulously
0.75
Activations Density 0.007%