Academic notes

Note on Variants of Diffusion Scheduler, DDPM DDIM PNDM

Motivation Given the popularity and power of diffusion models, the theoretical formulation of these models are not in unison. Because multiple groups have derived these models from different background, there exist multiple formulations, SDE, ODE, Markov Chain, Non-markov chain etc.

tech note Algorithm ODE SDE Dynamic System Generative Model Diffusion model

January 16, 2023

Note on Hippo- Recurrent Memory with Optimal Polynomial Projection

Hippo: Recurrent Memory with Optimal Polynomial Projection Motivation Hidden state in RNN represents a form of memory of the past. For a sequence, a natural way to represent the past sequence is to project it onto an orthonormal basis set. Here depending on the different emphasis of the past, we could define different measures on the time axis and define the basis set based on this measure. Then we can keep track of the projection coefficient on this basis when observing new data points.

tech note Algorithm RNN Recurrent Memory linear algebra Dynamic System Sequence Model

July 25, 2022

Note on S4-Efficiently Modeling Long Sequences with Structured State Spaces

[TOC] Motivation S4 sequence model is rising in the sequence modelling field. It dominates on long sequence modelling over RNN, LSTM and transformers. It’s both mathematically elegant and useful, and it’s trending, so why not write about it.

tech note Algorithm Statistical Learning linear algebra Dynamic System Sequence Model

July 17, 2022

Note on Equivalent Score Matching Objective

TOC {:toc} Motivation Consider a distribution $p(x)$, we could “convolve” it with a kernel $p(\tilde{x}\mid x)=q(\tilde{x}-x)$. The marginal distribution of $\tilde{x}$ is denoted as $p_\sigma(\tilde{x})$. We want to model the score of this convolved distribution and that of the original distribution $\nabla\log p_\sigma(\tilde{x})$ .

tech note Algorithm Machine Learning Statistical Learning Generative Model probability

June 20, 2022

Note on EM algorithm and likelihood lower bound

Motivation How to understand EM algorithm from a theoretical perspective? This post tries to understand EM as a form of alternative ascent of a lower bound of likelihood. The Key Trick of EM The key trick we need to remember is the usage of Jensen Inequality on logarithm. So we could swap Expectation and logarithm and obtain a lower bound on likelihood. Generally, we have such inequality, given a positive function $q(z)$ that sums to $1$ (probability density),

tech note Bayesian Statistical Learning MLE

May 13, 2022

Note on Diffusion Generative Models

TOC {:toc} Motivation Recently, a line of research emerged in generative image models, diffusion models, which showed a competitive performance with GAN [^1]. More recently, a larger scale version of it gave rise to the ground breaking model DALL-E 2 and its precursor GLIDE.

tech note Algorithm Machine Learning Statistical Learning Generative Model

May 9, 2022

Some Famous Matrix Determinant and Inversion Identities

Motivation How to compute determinant or inversion of matrix with a low rank modificaiton? This is a very interesting and important math technique in statistical methods, since people frequently model covariance matrix or connectivity matrix as such: a matrix plus a low rank modification.

April 16, 2022

Probabilistic Models of Neural Population

Motivation Here we summarize a few common probabilistic neural population models. Adapted from reading notes and class presentations from Neuro QC316 taught by Jan Drugowitsch. LNP, GLM These are the simplist models of neurons.

statistics EM neuro-model Bayesian Statistical Learning

April 16, 2022

Note on Kernel PCA

TOC {:toc} Motivation Simply put, “kernel trick” is the finding that sometimes only inner product appears in the formulation of some algorithms. because of this, we could substitute the inner product with some fancier kernel function, i.e. inner product in some other spaces. This post is about another usage of kernel trick. Another usage is Kernel (ridge) Regression.

tech note Algorithm Machine Learning Statistical Learning Applied Math Linear Algebra kernel method

March 21, 2022

Analytical Techniques for Finding Spectrum of Tridiagonal Toeplitz Matrix

Motivation As mentioned in our high-dimensional PCA note, understanding the spectrum of Toeplitz matrix is important. The subject itself is a bit technical, but the analytical techniques involved in it are splendid and general. So here I took note from this paper and present way to calculate the spectrum on paper (or by mathematica).

Linear Algebra Eigenvalue problem Recursion

February 4, 2022