INDEX
Explanations
specific nouns and terms related to activities and interactions
New Auto-Interp
Negative Logits
kasarigan
-0.64
Geplaatst
-0.58
-0.53
للاسماء
-0.48
surla
-0.47
Савезне
-0.45
ThemeOverlay
-0.43
الرياضيه
-0.43
*);
-0.43
wapV
-0.42
POSITIVE LOGITS
們
1.19
们
1.16
sthe
1.00
们的
0.88
ها
0.86
es
0.83
ss
0.83
들은
0.81
́s
0.81
ssss
0.77
Activations Density 3.097%