INDEX
Explanations
references to various types of candidates or subjects in discussions
New Auto-Interp
Negative Logits
ÑĦа
-0.14
isman
-0.14
kaç
-0.14
μη
-0.14
äºĨä¸Ģ
-0.14
lj
-0.14
lagi
-0.13
uce
-0.13
tle
-0.13
ÙħÙĪØ¯
-0.13
POSITIVE LOGITS
given
0.41
given
0.35
particular
0.34
GIVEN
0.29
Given
0.28
Given
0.26
_given
0.26
PARTICULAR
0.25
person
0.21
particul
0.21
Activations Density 0.275%