INDEX
Explanations
prepositions indicating a specific location or time
references to time and location in a narrative context
New Auto-Interp
Negative Logits
DRAG
-0.54
Uzbek
-0.53
fut
-0.52
anto
-0.51
Cub
-0.51
pact
-0.50
iant
-0.49
yo
-0.49
STEM
-0.48
Billion
-0.47
POSITIVE LOGITS
éĹĺ
0.64
itself
0.59
pers
0.58
ixel
0.57
notations
0.56
athi
0.54
hesion
0.54
mins
0.54
convey
0.53
ighting
0.53
Activations Density 1.102%