Intelligent Computing System is a combination of Deep Learning, Parallel Programming, Computer Organization and Computer Architecture.

Neural Network Basis

Loss function $L(w)=\frac{1}{2}\sum_i(H(x_i)-y_i)^2=\frac{1}{2}\sum_i(w^Tx-y_i)$

Gradient Descent: $w=w-\alpha\frac{\partial L(w)}{\partial w}$

Activate Function

Back Propagation: Chain Rule $\frac{\partial L}{\partial w}=\frac{\partial W}{\partial y}\frac{\partial y}{\partial z}\frac{\partial z}{\partial w}$

Neural Network structure: input layer, latent layer, output layer

CNN

convolution layer

pooling

fully connect + softmax $f(z_j)=\frac{e^{z_j}}{\sum_ie^{z_i}}$

z.B. alexnet, VGG, Inception, ResNet

How to judge CNN?

IoU aka Jaccard index 交并比
$IoU=\frac{A\bigcap B}{A\bigcup B}$
if IoU>0.5, location accepted.
mAP aka mean average precision
mAP $=\frac{\sum_{q=1}^QAveP(q)}{Q}$
recall $=\frac{TP}{TP+FN}$
precision $=\frac{TP}{TP+FP}$

Object detective

R-CNN, YOLO

sequence, recurrent, memory

LSTM

GRU

generator, judger

CGAN, ConditionGAN

Computation are expressed as stateful dataflow graphs.

All data is modelled as Tensor.

Computing operations running in Session.

Asynchronization execute stateful data flow graph through Queue.

Automatic differentiation

aka deep learning accelerator

DLP is an electronic circuit designed for deep learning algorithms, usually with separate data memory and dedicated instruction set architecture.

Aim to optimize:

DLP Instruction Set

Other accelerator

Heterogeneous computing 异构计算

How to develop a new operator? 如何开发一个新算子