INDEX
Explanations
numbers related to measurements or quantities
the number '6' in various contexts
New Auto-Interp
Negative Logits
tarn
-0.65
tain
-0.61
stand
-0.61
knowing
-0.60
pudding
-0.60
afore
-0.59
suff
-0.58
mimic
-0.58
stal
-0.58
secrecy
-0.58
POSITIVE LOGITS
6
3.14
7
2.44
5
2.43
8
2.37
9
2.23
4
2.20
3
1.99
2
1.82
1
1.66
0
1.62
Activations Density 0.029%