INDEX
Explanations
verbs indicating states of being and existence
New Auto-Interp
Negative Logits
Ñĺ
-0.16
allen
-0.15
hausen
-0.14
pcodes
-0.14
DONE
-0.14
andr
-0.14
::__
-0.14
posables
-0.13
ojÃŃ
-0.13
ãĤ¢ãĤ¤
-0.13
POSITIVE LOGITS
part
0.19
present
0.18
ornings
0.18
aska
0.16
supposed
0.15
marked
0.15
None
0.15
Pascal
0.15
ihan
0.15
garbage
0.15
Activations Density 0.194%