INDEX
Explanations
the concept or idea being discussed or mentioned in the text
references to concepts or ideas labeled as "notions."
New Auto-Interp
Negative Logits
avez
-0.63
Es
-0.61
annis
-0.60
gio
-0.57
acca
-0.57
pez
-0.56
conserv
-0.55
adra
-0.55
aqu
-0.55
opal
-0.55
POSITIVE LOGITS
ually
1.09
ally
1.05
rack
0.98
naire
0.92
ality
0.92
ively
0.90
edly
0.86
eers
0.86
notations
0.78
lessly
0.78
Activations Density 0.031%