INDEX
Explanations
specific quotations or phrases related to concepts and rules
New Auto-Interp
Negative Logits
réfrig
-0.61
meurt
-0.58
culturelles
-0.53
croit
-0.51
financières
-0.51
formales
-0.50
δες
-0.50
ouvriers
-0.50
sebaik
-0.49
promouvoir
-0.48
POSITIVE LOGITS
uxxxx
1.02
0.89
NSCoder
0.72
typelib
0.69
(?)
0.68
viewType
0.65
(?)
0.64
GEBURTSDATUM
0.63
contentLoaded
0.61
:✨
0.61
Activations Density 0.164%