INDEX
Explanations
strings with the word "it" followed by a number
occurrences of the word "it."
New Auto-Interp
Negative Logits
schooling
-0.71
hips
-0.69
destro
-0.68
«ĺ
-0.66
dise
-0.63
prevail
-0.63
¥ŀ
-0.62
prevailed
-0.60
parting
-0.60
therap
-0.59
POSITIVE LOGITS
anium
1.06
amins
1.01
herer
1.01
unes
0.98
self
0.97
chell
0.94
ople
0.92
rogen
0.92
atis
0.91
hers
0.90
Activations Density 0.034%