INDEX
Explanations
references to periods or durations of time
New Auto-Interp
Negative Logits
eson
-0.17
ayan
-0.17
arrants
-0.15
ýt
-0.15
-Fi
-0.15
ared
-0.14
wards
-0.14
ahir
-0.14
agers
-0.14
ards
-0.14
POSITIVE LOGITS
icals
0.44
ical
0.38
ont
0.29
ontology
0.25
ically
0.23
icity
0.23
ICAL
0.21
icial
0.19
زÙħاÙĨÛĮ
0.18
ict
0.17
Activations Density 0.040%