Robust Principal Component Analysis

The classical Principal Component Analysis (PCA) is widely used for high-dimensional analysis and dimensionality reduction. Mathematically, if all the data points are stacked as column vectors of a (n, m)matrix $M$, PCA tries to decompose $M$ as

More …

Chinese Characters Decoding using BeastifulSoup

When scraping Chinese website using python (Windows 10 system) and requests, it usually returns “gb2312” code for Chinese characters. However, if one does not declare the encoding of requests, it uses its default encoding, which is not “gb2312”. The following example provides a solution to scrape Chinese contents and save them as Unicode encoding in a data file.

More …

Reduce the cost for home services by electricians, plumbers, etc.

There are many single homes in Vancouver, and the majority of the homes are more than ten years old. Problems of electricity and water happen often. However, the cost of hiring an electrician or plumber is high. I am thinking to find a method to reduce the cost by employing modern technologies. For example, remote online consulting through mobile phones, which could solve some simple problems with small time and money costs. Another example would be install monitoring sensor and transmit the real-time status of things at home, good for predicting (machine learning) as well as revealing problems.