INDEX
Explanations
references to general items or concepts
New Auto-Interp
Negative Logits
lágrimas
-0.84
gales
-0.84
hanem
-0.83
Heber
-0.82
UpInside
-0.82
larmes
-0.82
adecimal
-0.80
Sapphire
-0.78
วาด
-0.78
fhew
-0.78
POSITIVE LOGITS
things
2.66
Things
2.31
Things
2.30
THINGS
2.22
thing
2.18
things
2.14
Thing
1.95
THING
1.85
Thing
1.78
THINGS
1.75
Activations Density 0.060%