INDEX
Explanations
words and names associated with a specific individual named "Bu."
New Auto-Interp
Negative Logits
387
-0.17
_accessible
-0.16
iac
-0.15
igua
-0.15
gow
-0.15
inf
-0.15
infer
-0.14
infer
-0.14
ergus
-0.14
expectation
-0.14
POSITIVE LOGITS
á»iji
0.17
apest
0.16
галÑĤеÑĢ
0.16
Aires
0.15
levard
0.15
rowspan
0.15
ilde
0.15
EMA
0.14
tones
0.14
ÑĢÑĥÑĤ
0.14
Activations Density 0.030%