INDEX
Explanations
proper nouns related to individuals or entities
New Auto-Interp
Negative Logits
alette
-0.07
ayah
-0.07
BERT
-0.07
ùa
-0.07
onDataChange
-0.07
roupon
-0.07
вик
-0.07
غاÙĨ
-0.07
ække
-0.06
aldo
-0.06
POSITIVE LOGITS
velt
0.09
his
0.07
inside
0.06
vek
0.06
Savannah
0.06
hay
0.06
se
0.06
omba
0.05
loy
0.05
rel
0.05
Activations Density 0.001%