INDEX
Explanations
conclusive statements and resolutions in narratives
New Auto-Interp
Negative Logits
alta
-0.15
aleb
-0.14
érc
-0.14
/tutorial
-0.14
roc
-0.14
اÛĮد
-0.14
orida
-0.14
uner
-0.13
istas
-0.13
áky
-0.13
POSITIVE LOGITS
accompl
0.27
accomplish
0.27
accomplished
0.23
accordingly
0.20
sher
0.19
accomplishment
0.18
doing
0.18
ê·¸ëŁ¬
0.17
ogan
0.17
LENG
0.17
Activations Density 0.267%