INDEX
Explanations
occurrences of the word "half"
references to the concept of "half."
New Auto-Interp
Negative Logits
spor
-0.53
rul
-0.52
igslist
-0.51
ocr
-0.51
andr
-0.51
andi
-0.51
laun
-0.50
ollen
-0.49
verbs
-0.49
berus
-0.49
POSITIVE LOGITS
of
0.84
ousand
0.69
way
0.67
wheel
0.66
terness
0.66
çͰ
0.65
pipe
0.64
azo
0.63
OTAL
0.63
century
0.62
Activations Density 0.059%