This question can be broken down into several use cases.
1. Learning the syntax of linear algebra and its relationship to matrix theory. Gotta get used to symbols, notation and jargon ASAP.
2. Some numerical linear algebra.
3. Stuff like gradient, Jacobian, Hessian
4. Some background on optimisation
5. numpy and scipy have buckets of stuff. You learn a lot by applying these libraries to practical cases. The Schaum books are great for input examples.
In general, I don't think that a book LA and ML exists.
(most books on ML fall short big time, unfortunately.. it is quite depressing to be honest)
Which books on 1-5 depends on .. but a first shot is
Shilov, G.E. (1977) Linear Algebra Dover.
Dahlquist, G. and Björck, A. (1974)
Numerical Methods. Dover
Kreider, D.L., Kuller, R.G., Ostberg, D.R. and Perkins, F.W. (1966)
An Introduction to Linear Analysis Addison-Wesley.
Nocedal, J. and Wright, S. (2006) Numerical Optimisation Springer. (bit more advanced)
// I am finishing the manuscript of my new PDE/FDM book and I have about 4-5 chapters on these topics.
I treat these and much on
www.datasim.nl online courses
// bit of name dropping: Gil Strang is my "academic PhD grandfather".