INDEX
Explanations
phrases indicating the sharing or reporting of information and details
New Auto-Interp
Negative Logits
çŃ
-0.15
rai
-0.14
aggio
-0.14
icio
-0.14
DEX
-0.14
ouncements
-0.13
elda
-0.13
šek
-0.13
lico
-0.13
ivet
-0.13
POSITIVE LOGITS
935
0.15
uya
0.14
بÙĪØ±
0.14
ILA
0.14
baru
0.14
637
0.14
archives
0.13
rů
0.13
:');↵
0.13
947
0.13
Activations Density 0.099%