INDEX
Explanations
discussions related to the flow of information, liquids, or energy
New Auto-Interp
Negative Logits
esquerdo
-0.45
inmobili
-0.44
shortName
-0.42
lendemain
-0.41
brittle
-0.39
finais
-0.39
lẻ
-0.38
Hohen
-0.38
Jurí
-0.38
femininos
-0.37
POSITIVE LOGITS
FLOW
1.17
STREAM
1.16
flow
1.13
Flow
1.13
flow
1.13
stream
1.09
Stream
1.04
流
1.01
Flows
1.01
Flow
1.00
Activations Density 0.333%