INDEX
Explanations
phrases expressing uncertainty or speculation about future events
New Auto-Interp
Negative Logits
iente
-0.17
hta
-0.15
artz
-0.15
onde
-0.15
something
-0.14
thing
-0.14
Something
-0.14
unner
-0.14
ikan
-0.14
athers
-0.14
POSITIVE LOGITS
happen
0.40
happens
0.40
happened
0.38
Happ
0.32
happening
0.31
aconte
0.28
happ
0.28
trans
0.22
appen
0.22
HAPP
0.21
Activations Density 0.064%