INDEX
Explanations
phrases indicating a process or activity that has been ongoing or completed
expressions related to processes or situations that are ongoing or in development
New Auto-Interp
Negative Logits
obal
-0.80
Flavoring
-0.67
ospons
-0.65
lav
-0.64
asar
-0.63
imar
-0.63
icipated
-0.61
amar
-0.59
ensions
-0.59
ush
-0.58
POSITIVE LOGITS
psychiat
0.63
psychopath
0.60
FontSize
0.58
understatement
0.58
leash
0.58
announcer
0.57
applause
0.57
WHERE
0.56
bos
0.56
beh
0.55
Activations Density 0.844%