I'm in the midst of validating a set of regression models that we
developed on a randomly selected 2/3rds of a dataset. Our goal is to come
up with prediction equations that will best predict data from new samples.

In the course of working on this, I'm starting to wonder about how
advisable it is to rely on a *single* division of our total sample into
base and validation samples for this purpose. In these days of
'computer-intensive' analyses, is there a reasonable way of trying out lots
of different divisions? For instance, would it be reasonable to generate
bootstrapped sampling distributions of the various parameter estimates and
then use the medians of those distributions for predicting to new samples?
(If so, is that still reasonable if the predictors are not orthogonal to
one another?). Or is this sort of thing not worth the bother?

