INDEX
Explanations
future tense verbs and expressions indicating ongoing actions or states
New Auto-Interp
Negative Logits
226
-0.16
this
-0.15
zer
-0.15
cano
-0.14
éĤ£æł·
-0.14
Äij
-0.14
Nob
-0.14
bug
-0.14
alike
-0.14
mond
-0.14
POSITIVE LOGITS
happening
0.22
happen
0.21
happens
0.20
happened
0.18
ellas
0.17
done
0.16
stuff
0.16
agrams
0.16
agram
0.16
done
0.16
Activations Density 0.234%