INDEX
Explanations
references to poetry and poetic forms
New Auto-Interp
Negative Logits
I
-0.52
A
-0.51
setBounds
-0.50
in
-0.49
b
-0.49
//
-0.49
estimés
-0.47
by
-0.47
S
-0.47
B
-0.46
POSITIVE LOGITS
poetry
1.92
poems
1.81
poem
1.77
poets
1.67
Poetry
1.67
Poem
1.64
Poetry
1.60
poetry
1.59
poet
1.58
Poems
1.49
Activations Density 0.154%