INDEX
Explanations
intros or introductions within texts
instances of the word "intro" and related variations
New Auto-Interp
Negative Logits
ware
-0.77
land
-0.70
parcels
-0.69
munitions
-0.69
endants
-0.68
Maritime
-0.68
wald
-0.66
Century
-0.66
mills
-0.65
UST
-0.65
POSITIVE LOGITS
intro
3.65
Intro
2.80
introductory
1.36
introdu
1.22
Introduction
1.20
introduction
1.15
undergrad
1.07
opener
1.06
autobi
0.97
emot
0.95
Activations Density 0.023%