INDEX
Explanations
transitional words and phrases indicating contrast or emphasis
New Auto-Interp
Negative Logits
المناصب
-0.70
miniaturka
-0.56
الحياه
-0.56
IsMutable
-0.54
kasarigan
-0.53
المشاركات
-0.53
ivelany
-0.53
يتيمه
-0.52
rungsseite
-0.52
estekak
-0.52
POSITIVE LOGITS
also
0.39
SBATCH
0.37
tagext
0.35
podjela
0.34
photographed
0.34
mstyle
0.31
AssemblyCompany
0.31
heb
0.30
quiser
0.30
But
0.29
Activations Density 0.522%