INDEX
Explanations
adjectives describing a lack of awareness or experience
occurrences of the word "naive" and its variations in contexts questioning judgment and innocence
New Auto-Interp
Negative Logits
ŃĶ
-0.80
Downloadha
-0.77
alach
-0.76
srf
-0.74
interrupted
-0.73
contiguous
-0.72
hops
-0.71
ngth
-0.71
avez
-0.70
foreseen
-0.69
POSITIVE LOGITS
te
0.99
naive
0.98
lings
0.92
sters
0.86
tle
0.84
ster
0.82
ted
0.82
innocence
0.82
ïve
0.81
naïve
0.81
Activations Density 0.013%