INDEX
Explanations
phrases that contain multiple uses of the word "in" within a context of analysis or description
New Auto-Interp
Negative Logits
abay
-0.16
_exchange
-0.15
fact
-0.15
Exchange
-0.15
FACT
-0.15
cie
-0.14
ambi
-0.14
åĢ
-0.14
spite
-0.13
fact
-0.13
POSITIVE LOGITS
detail
0.61
detail
0.49
depth
0.47
Detail
0.43
greater
0.42
-detail
0.40
Detail
0.39
great
0.36
depth
0.36
detalle
0.35
Activations Density 0.160%