INDEX
Explanations
phrases indicating innovative ideas or creative thoughts
New Auto-Interp
Negative Logits
UNK
-0.17
stanov
-0.15
unk
-0.14
iens
-0.14
ibal
-0.14
_sdk
-0.14
rlen
-0.14
Äįka
-0.14
okud
-0.14
ogy
-0.14
POSITIVE LOGITS
idea
0.96
ideas
0.86
idea
0.82
Idea
0.81
Ideas
0.73
ideas
0.73
IDEA
0.52
concept
0.48
.idea
0.48
concepts
0.44
Activations Density 0.378%