INDEX
Explanations
various forms of comparison and resemblance in descriptions
New Auto-Interp
Negative Logits
httphttps
-0.61
fazer
-0.50
keduanya
-0.49
critères
-0.48
casada
-0.48
bezpiecze
-0.47
Herrn
-0.47
Geräten
-0.46
bestaan
-0.45
którzy
-0.45
POSITIVE LOGITS
akin
0.50
Савезне
0.46
like
0.46
a
0.44
pseudo
0.42
bl
0.42
ers
0.42
jud
0.41
quasi
0.41
modern
0.41
Activations Density 0.452%