INDEX
Explanations
numbers and numerical values
numerical data and estimates typically associated with statistics or predictions
New Auto-Interp
Negative Logits
beauty
-0.68
bombard
-0.65
roses
-0.62
liberating
-0.62
kit
-0.62
swim
-0.61
beaut
-0.60
stre
-0.60
bunny
-0.59
dressing
-0.59
POSITIVE LOGITS
5
1.32
75
1.23
8
1.19
6
1.19
7
1.16
50
1.15
25
1.14
0
1.14
06
1.14
9
1.11
Activations Density 0.094%