INDEX
Explanations
names and references to significant individuals, places, or events
New Auto-Interp
Negative Logits
uum
-0.34
uous
-0.34
uu
-0.34
uru
-0.34
u
-0.34
ucu
-0.33
unu
-0.33
ulus
-0.33
ucus
-0.33
uf
-0.32
POSITIVE LOGITS
klady
0.15
ạng
0.14
ảnh
0.13
dıģında
0.13
ặng
0.13
ắng
0.13
jezd
0.12
ẳng
0.12
ẳ
0.12
agon
0.12
Activations Density 0.579%