INDEX
Explanations
concepts or ideas
references to the concept of "notion."
New Auto-Interp
Negative Logits
avez
-0.64
annis
-0.63
interrupted
-0.61
aqu
-0.60
Stard
-0.59
ãĤ
-0.58
onz
-0.57
acca
-0.57
onder
-0.57
contractor
-0.56
POSITIVE LOGITS
ually
1.23
ally
0.94
ality
0.89
ively
0.88
icity
0.86
rack
0.86
uality
0.85
eers
0.82
ologies
0.80
matic
0.79
Activations Density 0.032%