INDEX
Explanations
phrases that denote a concept or something specific
phrases indicating the concept of "past events or things."
New Auto-Interp
Negative Logits
Respons
-0.61
tails
-0.60
burse
-0.60
opus
-0.59
oons
-0.59
CPU
-0.59
oglu
-0.58
abad
-0.58
\-
-0.58
ped
-0.58
POSITIVE LOGITS
significance
0.90
importance
0.84
beauty
0.82
legend
0.80
substance
0.78
nightmares
0.75
note
0.75
ificantly
0.74
utmost
0.74
folklore
0.72
Activations Density 0.034%