INDEX
Explanations
numeric values indicating significant events or milestones
New Auto-Interp
Negative Logits
1
-0.45
2
-0.39
3
-0.34
mathrm
-0.33
4
-0.31
6
-0.30
synthesize
-0.30
5
-0.30
ifflin
-0.29
sense
-0.28
POSITIVE LOGITS
Paglinawan
0.71
ſelf
0.66
betweenstory
0.64
вікісторінку
0.63
Italijanski
0.61
चीज़ों
0.60
ſch
0.60
kuiten
0.59
帖最后由
0.55
thousands
0.54
Activations Density 0.260%