INDEX
Explanations
phrases indicating specific occurrences or referents in context
New Auto-Interp
Negative Logits
more
-0.54
tamment
-0.53
antaranya
-0.50
sonst
-0.50
ninh
-0.49
ampingi
-0.49
êmio
-0.48
sonian
-0.48
Contactez
-0.47
rarement
-0.46
POSITIVE LOGITS
この
0.70
這一
0.69
este
0.68
ఈ
0.67
kasarigan
0.67
ఈ
0.67
この
0.66
ഈ
0.66
Этот
0.65
kind
0.65
Activations Density 0.137%