INDEX
Explanations
sentences with an emphasis on abstract concepts or philosophical musings
concepts of philosophical depth or irony
New Auto-Interp
Negative Logits
dit
-0.79
byter
-0.75
idem
-0.75
tenance
-0.71
catentry
-0.70
BUS
-0.70
MpServer
-0.69
KO
-0.69
linger
-0.68
vice
-0.66
POSITIVE LOGITS
dwelling
0.72
tones
0.70
watching
0.69
Enlightenment
0.69
these
0.68
effic
0.67
THESE
0.66
knowing
0.66
detecting
0.65
adding
0.65
Activations Density 0.209%