INDEX
Explanations
words related to important or significant concepts or ideas
important concepts or significant elements within a text
New Auto-Interp
Negative Logits
ews
-0.71
Tsukuyomi
-0.71
uthor
-0.68
asca
-0.67
AUT
-0.67
Bett
-0.67
Horses
-0.66
Mostly
-0.66
Chatt
-0.66
Fever
-0.65
POSITIVE LOGITS
stone
1.19
stroke
1.19
stro
1.01
stones
0.99
*/(
0.90
binding
0.90
wcs
0.89
hole
0.89
ring
0.88
key
0.86
Activations Density 0.020%