INDEX
Explanations
english, Dek, Hammer, effect, oxygen
New Auto-Interp
Negative Logits
उ
0.46
ureshi
0.45
intah
0.44
eches
0.44
agenda
0.44
ictus
0.44
akarta
0.43
Karachi
0.43
besar
0.43
attacks
0.42
POSITIVE LOGITS
minim
0.48
συνεχ
0.48
&.
0.42
headroom
0.42
Continu
0.42
enthusiasm
0.41
เก
0.41
&-
0.41
originals
0.41
سطح
0.40
Activations Density 0.001%