INDEX
Explanations
phrases indicating time spent or frequency of activities
New Auto-Interp
Negative Logits
undler
-0.16
ICENSE
-0.15
\grid
-0.14
Dalton
-0.14
Manor
-0.14
casting
-0.14
it
-0.13
raci
-0.13
anth
-0.13
xfb
-0.13
POSITIVE LOGITS
ãĥªãĥ¼ãĤº
0.14
endi
0.14
ovich
0.14
roit
0.14
agi
0.14
çļĦåľ°
0.14
gı
0.13
ÃĤu
0.13
firefight
0.13
lace
0.13
Activations Density 0.040%