INDEX
Explanations
numeric values with a specific format
counts or statistics relating to social media interactions
New Auto-Interp
Negative Logits
okin
-0.67
wagen
-0.67
andestine
-0.66
gow
-0.66
Iterator
-0.62
arios
-0.61
ornia
-0.60
refere
-0.59
Gardens
-0.59
pigeon
-0.58
POSITIVE LOGITS
/+
1.17
-+-+-+-+
1.01
/-
0.88
-+
0.83
--+
0.80
minus
0.80
=~=~
0.74
ï¸ı
0.72
(âĪĴ
0.71
Spoiler
0.71
Activations Density 0.106%