INDEX
Explanations
phrases related to comparisons and contrasts
instances of the word "what" in relation to various topics or concepts
New Auto-Interp
Negative Logits
enburg
-0.74
ster
-0.64
jee
-0.64
por
-0.62
enberg
-0.56
gur
-0.56
ji
-0.56
lich
-0.55
caveat
-0.55
largeDownload
-0.54
POSITIVE LOGITS
happens
1.33
soever
1.32
happened
1.31
transpired
1.17
constitutes
1.12
else
0.99
happ
0.93
constituted
0.89
separates
0.87
occurs
0.81
Activations Density 0.109%