INDEX
Explanations
occurrences of time-related prepositions
New Auto-Interp
Negative Logits
ube
-0.07
ahu
-0.07
ophon
-0.06
avras
-0.06
roph
-0.06
aring
-0.06
libft
-0.06
idf
-0.06
shade
-0.06
fit
-0.06
POSITIVE LOGITS
mine
0.08
aky
0.07
ollow
0.06
uiten
0.06
IFO
0.06
ırak
0.06
eÄį
0.06
ç¾½
0.06
ixon
0.06
Combat
0.06
Activations Density 0.008%