INDEX
Explanations
numerical values or counts related to various subjects
New Auto-Interp
Negative Logits
chevalier
-0.65
ССР
-0.60
itinéraire
-0.60
Ainda
-0.58
Nähe
-0.58
magie
-0.56
soldat
-0.55
ílica
-0.55
оригіналу
-0.54
combien
-0.54
POSITIVE LOGITS
three
0.82
four
0.81
two
0.79
different
0.79
major
0.78
other
0.74
distinct
0.74
zwei
0.73
mini
0.73
three
0.72
Activations Density 0.706%