INDEX
Explanations
phrases related to scheduled activities or events
New Auto-Interp
Negative Logits
isclosed
-0.17
avir
-0.16
luž
-0.16
iza
-0.15
itur
-0.15
oug
-0.14
TestFixture
-0.14
yola
-0.14
lad
-0.14
εια
-0.14
POSITIVE LOGITS
iker
0.16
ewolf
0.15
Nap
0.14
ahi
0.13
iaux
0.13
alink
0.13
andex
0.13
Nickel
0.13
اگ
0.13
ambit
0.13
Activations Density 0.034%