INDEX
Explanations
phrases indicating inevitability or the passage of time
New Auto-Interp
Negative Logits
.getValueAt
-0.17
ippets
-0.16
_VC
-0.15
onis
-0.15
-ie
-0.15
UNUSED
-0.15
Gilbert
-0.15
iktig
-0.15
립
-0.14
озем
-0.14
POSITIVE LOGITS
inevitable
0.36
inev
0.36
matter
0.35
matter
0.30
Matter
0.27
sooner
0.25
matters
0.23
logical
0.23
ine
0.23
natural
0.22
Activations Density 0.048%