INDEX
Explanations
concept, meaning, concert, reduce, within, earn, implies
New Auto-Interp
Negative Logits
BRANCH
0.43
ಅಂತ
0.42
HEAD
0.41
on
0.41
MATTER
0.41
N
0.40
sent
0.40
SENT
0.40
𝗙
0.40
𝙃
0.39
POSITIVE LOGITS
chaleur
0.49
essa
0.47
diminue
0.47
easier
0.46
minimize
0.46
bardzo
0.45
easement
0.45
maybe
0.44
esa
0.44
ebenfalls
0.44
Activations Density 0.011%