INDEX
Explanations
words related to data or information
the presence of the word "ont" in various contexts
New Auto-Interp
Negative Logits
BILITIES
-0.82
loo
-0.76
GBT
-0.72
ulative
-0.71
tips
-0.71
GY
-0.70
glers
-0.68
DOM
-0.67
FFER
-0.67
MU
-0.67
POSITIVE LOGITS
gomery
1.01
osaurus
0.97
ario
0.95
rol
0.93
ological
0.92
ainment
0.90
ology
0.88
aire
0.87
ract
0.86
ribut
0.84
Activations Density 0.017%