Posted 2025-07-14Updated 2025-12-07Note5 minutes read (About 786 words)

Homework 5 Report

Zhichen Tang
124370910020

Exercise 1

In this first part, I looked at the logistic model for population growth. The main goal was to solve the equation $u’(t) = C u(t) (1 - u(t)/B)$ using a couple of different numerical methods. The parameters were set to $u_0=100$, $B=1000$, and $C=2/15$.

a)

I take the current value and add the slope at that point multiplied by the time step to get the next value.

% Completed feuler.m
function [t, u, dt] = feuler( f, I, u0, N )
    dt = (I(2)-I(1))/N;
    t = linspace(I(1),I(2),N+1);
    u = zeros(1,N+1);
    u(1) = u0;
    for n = 1:N
        u(n+1) = u(n) + dt * f(t(n), u(n));
    end
end
````


### b)
First, I calculated a temporary "predicted" value (just like a standard Euler step). Then, I averaged the slope at the current point and the slope at the next point (using the predicted value) to get a more accurate step.

Matlab

% Completed heun.m
function [t, u, dt] = heun( f, I, u0, N )
dt = (I(2)-I(1))/N;
t = linspace(I(1),I(2),N+1);
u = zeros(1,N+1);
u(1) = u0;
for n = 1:N
u_predictor = u(n) + dt * f(t(n), u(n));
u(n+1) = u(n) + (dt/2) * ( f(t(n), u(n)) + f(t(n+1), u_predictor) );
end
end


<div class="post-content"><img src="/2025/07/14/[OBS]课程-Math6003 Hw5/ex1b.png" alt="" title=""></div>
### c) 

After implementing the methods, I ran them with a coarse time step (Deltat=5, corresponding to N=20) and plotted them against the exact solution.

It's pretty clear from the graph that the Heun method does a much better job. Its curve hugs the exact solution line much more tightly than the Forward Euler curve does. This makes sense because it's a higher-order method, so you'd expect it to be more accurate for the same step size.

<div class="post-content"><img src="/2025/07/14/[OBS]课程-Math6003 Hw5/ex1c.png" alt="" title=""></div>

### d)

Next, I repeated the comparison but with a much smaller time step (Deltat=0.05, or N=2000).


Absolutely. With the smaller time step, both methods improved dramatically. The plot shows that the lines for the Euler method, Heun's method, and the exact solution are basically right on top of each other. This really shows how decreasing the step size can significantly reduce the error in numerical solutions. However, the Heun method's absolute error is still lower than the Euler method.

<div class="post-content"><img src="/2025/07/14/[OBS]课程-Math6003 Hw5/ex1d.png" alt="" title=""></div>

---

## Exercise 2

### a)

To solve this, I first had to convert the single 2nd-order equation into a system of two 1st-order equations. I defined a new variable, V(t)=U′(t). This gives the first equation of the system.

Then, I rearranged the original RLC equation to isolate U′′(t):

$$ U''(t) = -\frac{R}{L}U'(t) - \frac{1}{LC}U(t) + \frac{f}{LC} $$

By substituting V for U′ and V′ for U′′, I got the second equation. This allowed me to write the whole system in matrix form, X′=AX+b:

$$\begin{pmatrix} V' \\ U' \end{pmatrix} = \begin{pmatrix} -R/L & -1/(LC) \\ 1 & 0 \end{pmatrix} \begin{pmatrix} V \\ U \end{pmatrix} + \begin{pmatrix} f/(LC) \\ 0 \end{pmatrix} $$
### b)
Stability Analysis of Forward Euler The Forward Euler method is only stable if $|1 + \Delta t \lambda_i| \le 1$ for all eigenvalues ($\lambda_i$) of the matrix A. Using the given component values (L=0.01, C=10, R=0.1), the matrix A becomes: $$ A = \begin{pmatrix} -10 & -10 \\ 1 & 0 \end{pmatrix} $$ I calculated the eigenvalues to be $\lambda = -5 \pm \sqrt{15}$. To ensure stability, the time step $\Delta t$ has to be less than a critical value determined by the eigenvalue with the largest magnitude. The calculation showed that the method is stable only if: $$ \Delta t \le \frac{2}{5 + \sqrt{15}} \approx 0.2254 \text{ s} $$
### c) 
Simulating with Forward Euler I ran simulations for three different time steps. The results were a perfect illustration of the stability condition. 
<div class="post-content"><img src="/2025/07/14/[OBS]课程-Math6003 Hw5/ex2c.png" alt="" title=""></div>
As you can see: - With **N=43** ($\Delta t \approx 0.233$), the time step was too large, and the solution completely blew up. - With **N=46** ($\Delta t \approx 0.217$), the time step was just inside the stable range, and the solution correctly showed a damped wave. - With **N=500** ($\Delta t = 0.02$), the solution was also stable and much smoother. 
### d)
Simulating with Backward Euler Finally, I repeated the same simulations using the Backward Euler method. 
<div class="post-content"><img src="/2025/07/14/[OBS]课程-Math6003 Hw5/ex2d.png" alt="" title=""></div>
The difference is night and day. The Backward Euler method was **stable for all three time steps**, even the large one that made the Forward Euler method fail. This shows that implicit methods like Backward Euler are much more robust for systems like this, as they don't have the same strict stability requirements on the time step. While the accuracy still gets better with smaller steps, you can trust it to give a stable answer no matter what.

Posted 2025-07-06Updated 2025-12-07Notea few seconds read (About 54 words)

Use SSH to Connect TensorboardX

使用ssh作为命令行远程工具，启动远程的tensorboardx并且在本地的浏览器中打开。

远程运行：

1	tensorboard --logdir <path> --port 6006

本地运行：

1	ssh -N -f -L localhost:16006:localhost:6006 bohan@10.11.16.146

Posted 2025-06-22Updated 2025-12-07Note7 minutes read (About 1067 words)

Math6003 Hw4

Ex 1

a

The result of hw4_1_a.m:

### b The result of `hw4_1_b.m`

The relative error epsilon of (b) is lower than (a) ### c

Vandermonde matrices are known to be ill-conditioned, with the condition number growing exponentially with $n$.

d

This tridiagonal matrix is well-conditioned. Its condition number grows polynomially with $n$ (like $O(n^2)$). The error $\epsilon_n$ should grow much more slowly than in the Vandermonde case, exhibiting polynomial rather than exponential growth. The residual $r_n$ will be a better, though not perfect, indicator of the true error compared to the previous case.

Comments on the obtained results:

$\textbf{Error Growth:}$ The relative error $\epsilon_n$ for the tridiagonal system grows polynomially with $n$, which is significantly slower than the exponential growth observed for the Vandermonde matrix. This will be visible as a straight line on the loglog plot.
$\textbf{Conditioning:}$ The dramatic difference in error behavior is due to the condition number. The Vandermonde matrix is severely ill-conditioned, whereas the tridiagonal matrix is well-conditioned.
$\textbf{Residual as an Error Indicator:}$ For the well-conditioned tridiagonal matrix, the normalized residual $r_n$ and the relative error $\epsilon_n$ have much closer values. The residual is a far more reliable indicator of the true error in this case than it was for the Vandermonde system.

Ex 2

a

The task is to complete the files $jacobi.m$ and $gauss_seidel.m$. These files implement iterative methods based on a matrix splitting $A = P - (P-A)$, where $P$ is the preconditioner. The iterative update can be expressed as $x^{(k+1)} = x^{(k)} + z^{(k)}$, where $z^{(k)}$ is the solution to $Pz^{(k)} = r^{(k)}$, and $r^{(k)} = b - Ax^{(k)}$ is the residual.

For the $\textbf{Jacobi method}$, the preconditioner P is the diagonal of A. For the $\textbf{Gauss-Seidel method}$, P is the lower triangular part of A. The missing lines implement the calculation of the residual, the update step, and the storing of the residual norm at each iteration.

jacobi.m:

function [x, iter, res]= jacobi(A,b,x0,nmax,tol)
% JACOBI iterative method
%   [X, Niter, Res] = JACOBI(A,B,X0,NMAX,TOL) attempts to solve the system of linear equations A*X=B
%   for X using the Jacobi method.

% Jacobi preconditioner
P=diag(diag(A)); %

% residual normalization
r0=norm(b);
if r0==0, r0=1; end

% first iteration
x=x0;
r=b-A*x; % The residual b - A*x
res(1)=norm(r); % The norm of the initial residual
iter=0;

while res(end)/r0 > tol && iter < nmax
    z=P\r; % Solve Pz=r
    x=x+z; % Update the solution
    r=b-A*x; % Recompute the residual
    iter=iter+1;
    res(iter+1)=norm(r); % Store the norm of the new residual
end

return

gauss_seidel.m:

function [x, iter, res]= gauss_seidel(A,b,x0,nmax,tol)
% GAUSS-SEIDEL iterative method
%   [X, Niter, Res] = GAUSS_SEIDEL(A,B,X0,NMAX,TOL) attempts to solve the system of linear equations A*X=B
%   for X using the Gauss-Seidel method.

% Gauss-Seidel preconditioner
P=tril(A); %

% residual normalization
r0=norm(b);
if r0==0, r0=1; end

% first iteration
x=x0;
r=b-A*x; % The residual b - A*x
res(1)=norm(r); % The norm of the initial residual
iter=0;

while res(end)/r0 > tol && iter < nmax
    z=P\r; % Solve Pz=r using forward substitution
    x=x+z; % Update the solution
    r=b-A*x; % Recompute the residual
    iter=iter+1;
    res(iter+1)=norm(r); % Store the norm of the new residual
end

return

b

I solve the system for $n=4$ with the tridiagonal matrix $A$ having $2.5$ on the diagonal and $-1$ on the off-diagonals. The right-hand side is $b=(1.5,0.5,0.5,1.5)^{\top}$. The initial guess is $x^{(0)}=0$, tolerance is $10^{-10}$, and max iterations is $10^8$. I compare the number of iterations and computing time.

The matrix $A$ is strictly diagonally dominant since $|a_{ii}| = 2.5 > \sum_{j \neq i} |a_{ij}|$ for all rows. This property guarantees that both methods will converge.

Guass-Seidek converges faster. ### c The analysis is repeated for the same matrix family with $n$ varying from 4 to 200. The exact solution is $x=(1,...,1)^{\top}$. I plot the number of iterations, the relative error $||x-x_{c}||/||x||$, and the normalized residual $||b-Ax_{c}||/||b||$ versus $n$. I also examine how the condition number of A changes with n.

For this well-conditioned matrix (as shown in part e), the number of iterations grows very slowly with $n$. The condition number remains small and constant. The relative error and the normalized residual are very close in value, indicating that the residual is a good estimator of the error.

d

The analysis from (c) is repeated for the matrix $A$ with 2 on the diagonal and -1 on the off-diagonals , and $b=(1,0,…,0,1)^{\top}$. The exact solution is again $x=(1,…,1)^{\top}$. This matrix is only weakly diagonally dominant, so I expect slower convergence.

The number of iterations will grow much more rapidly with $n$. This is because, as shown in part (e), the condition number of this matrix grows quadratically with $n$, making the problem progressively harder to solve.

e

I use the given formula for the eigenvalues of a symmetric tridiagonal matrix $M$ with $a$ on the diagonal and $b$ on the off-diagonals:
$$ \lambda_{i}(M)=a+2b\cos\left(\frac{\pi i}{n+1}\right), \quad i=1,…,n $$
The condition number is $\kappa_2(A) = |\lambda_{max}|/|\lambda_{min}|$.

Case 1: Matrix from part (c)
Here, $a=2.5$ and $b=-1$. The eigenvalues are $\lambda_i = 2.5 - 2\cos(\frac{\pi i}{n+1})$.

As $n\to\infty$, $\lambda_{max} \to 2.5 - 2(-1) = 4.5$.
As $n\to\infty$, $\lambda_{min} \to 2.5 - 2(1) = 0.5$.

The condition number $\kappa(A)$ approaches a constant, $\kappa(A) \to 4.5 / 0.5 = 9$. This is consistent with the numerical results from part (c), explaining the fast and stable convergence.

Case 2: Matrix from part (d)
Here, $a=2$ and $b=-1$. The eigenvalues are $\lambda_i = 2 - 2\cos(\frac{\pi i}{n+1}) = 4\sin^2(\frac{\pi i}{2(n+1)})$.

As $n\to\infty$, $\lambda_{max} \to 4\sin^2(\pi/2) = 4$.
For large $n$, $\lambda_{min} = 4\sin^2(\frac{\pi}{2(n+1)}) \approx 4(\frac{\pi}{2(n+1)})^2 = \frac{\pi^2}{(n+1)^2}$.
The condition number $\kappa(A) \approx \frac{4}{\pi^2/(n+1)^2} = O(n^2)$. This quadratic growth in the condition number is consistent with the numerical results from part (d), explaining the significant increase in iterations with $n$.

Posted 2025-06-16Updated 2025-12-07Note7 minutes read (About 975 words)

数学HW3

Exercise 1

Problem Analysis

The objective is to find the coefficients $(\alpha, \beta, \gamma)$ for the finite difference formula:
$$
D_{h}f(\overline{x})=\frac{\alpha f(\overline{x})+\beta f(\overline{x}-h)+\gamma f(\overline{x}-2h)}{h} \quad
$$
The analysis begins by substituting the Taylor series expansions for $f(\overline{x}-h)$ and $f(\overline{x}-2h)$ around the point $\overline{x}$ into the formula.

The expansions are:
$$
f(\overline{x}-h) = f(\overline{x}) - hf’(\overline{x}) + \frac{h^2}{2}f’’(\overline{x}) - \frac{h^3}{6}f’’’(\overline{x}) + O(h^4)
$$
$$
f(\overline{x}-2h) = f(\overline{x}) - 2hf’(\overline{x}) + 2h^2f’’(\overline{x}) - \frac{4h^3}{3}f’’’(\overline{x}) + O(h^4)
$$
Substituting these into the formula and grouping terms by derivatives of $f(\overline{x})$ results in:
$$
D_{h}f(\overline{x}) = \frac{\alpha + \beta + \gamma}{h} f(\overline{x}) + (-\beta - 2\gamma) f’(\overline{x}) + \left( \frac{\beta}{2} + 2\gamma \right)h f’’(\overline{x}) + O(h^2)
$$
To approximate $f’(\overline{x})$, the coefficient of $f’(\overline{x})$ must be $1$, and the coefficient of $f(\overline{x})$ must be $0$. This yields two necessary equations:

$\alpha + \beta + \gamma = 0$
$-\beta - 2\gamma = 1$

a) Solution for First-Order Accuracy

For the formula to be of order 1, the two fundamental equations must be satisfied.

From equation (2), $\beta$ can be expressed in terms of $\gamma$:
$$
\beta = -1 - 2\gamma
$$
Substituting this into equation (1):
$$
\alpha + (-1 - 2\gamma) + \gamma = 0 \implies \alpha = 1 + \gamma
$$
Therefore, the family of coefficients $(\alpha, \beta, \gamma)$ that provides at least first-order accuracy is given by:
$$
(\alpha, \beta, \gamma) = (1+\gamma, -1-2\gamma, \gamma), \quad \forall \gamma \in \mathbb{R}
$$

b) Solution for Second-Order Accuracy

To achieve second-order accuracy, the truncation error must be $O(h^2)$. This requires the coefficient of the $h$ term in the error expansion to also be zero.

$$\frac{\beta}{2} + 2\gamma = 0$$

This creates a system of three linear equations to be solved for the unique values of $(\alpha, \beta, \gamma)$:

$\alpha + \beta + \gamma = 0$
$-\beta - 2\gamma = 1$
$\frac{\beta}{2} + 2\gamma = 0$

From equation (3), we find that $\beta = -4\gamma$. Substituting this into equation (2):
$$
-(-4\gamma) - 2\gamma = 1 \implies 2\gamma = 1 \implies \gamma = \frac{1}{2}
$$
With the value for $\gamma$, $\beta$ can be found:
$$
\beta = -4 \left( \frac{1}{2} \right) = -2
$$
Finally, substituting $\beta$ and $\gamma$ into equation (1) yields $\alpha$:
$$
\alpha + (-2) + \frac{1}{2} = 0 \implies \alpha = \frac{3}{2}
$$
The unique values which give a formula of order 2 are:
$$
\left(\alpha, \beta, \gamma\right) = \left(\frac{3}{2}, -2, \frac{1}{2}\right)
$$

Exercise 2

a) Midpoint Formula Calculation

The first task is to compute the integral of the function $f(x)=e^{-x}sin(x)$ on the interval $[a,b]=[0,2]$. The computation uses the composite midpoint formula with $M=10$ sub-intervals.

The resulting numerical approximation of the integral is 0.468592. The exact integral is 0.466630. So the error is 0.001962.

b) Convergence of the Midpoint Formula

This section analyzes the convergence of the composite midpoint formula for the same function and interval. The number of sub-intervals is varied as $M=10^{1},10^{2},10^{3},…10^{5}$.

The absolute error, $|I(f)-Q_{h}^{pm}(f)|$, is computed for each corresponding step size, $h=(b-a)/M$. This error is the difference between the numerical result and the exact value of the integral, $I=(1-e^{-2}(sin(2)+cos(2)))/2$.

A graph of the error versus the step size $h$ is then created using a logarithmic scale for both axes. In such a plot, the slope of the resulting line corresponds to the order of convergence.

**Plot Analysis**

The plot displays a straight line with a negative slope. A linear fit to the logarithmic data reveals a slope of approximately 2. This visually confirms that the method exhibits 2nd-order convergence, which aligns with the theoretical error bounds for the composite midpoint rule.

c) Convergence of Trapezoidal and Simpson’s Formulas

The analysis is repeated using the composite trapezoidal formula and the composite Simpson’s formula. The errors for both methods, $|I(f)-Q_{h}^{trap}(f)|$ and $|I(f)-Q_{h}^{simp}(f)|$, are plotted on the same graph in logarithmic scale.

**Plot Analysis and Comparison**

The resulting graph contains two distinct straight lines, one for each method.

Trapezoidal Rule: The error line for the trapezoidal rule has a slope of approximately 2. This confirms its theoretical 2nd-order convergence.
Simpson’s Rule: The error line for Simpson’s rule is significantly steeper, with a slope of approximately 4. This demonstrates its theoretical 4th-order convergence.

A comparison of the results shows that for any given step size $h$, the error from Simpson’s rule is substantially smaller than the error from the trapezoidal rule, highlighting the superior accuracy of the higher-order method for smooth functions.

d) Analysis for a Non-Smooth Function

Finally, the convergence analysis is repeated for all three methods on a new function, $f(x)=\sqrt{|x|^{3}}$, over the interval $[a,b]=[-2,2]$. The exact value for this integral is $I=\frac{16}{5}\sqrt{2}$.

Comments on the Obtained Results

The function $f(x)=\sqrt{|x|^{3}}$ is not sufficiently smooth at the point $x=0$ within the integration interval. While the function and its first derivative are continuous, its second derivative, $f’’(x) \propto |x|^{-1/2}$, is unbounded at $x=0$.

The theoretical error estimates that predict 2nd and 4th-order convergence for these methods are based on the assumption that the function’s higher-order derivatives are continuous and bounded. Since this condition is violated, a degradation in the observed convergence rates is expected.

**Plot Analysis** The log-log plot for this non-smooth function reveals the following: * The convergence rates for all three methods are significantly reduced. * The slopes of the error lines for the midpoint, trapezoidal, and Simpson's rules are all approximately **1.5**. * The high-order accuracy advantage of Simpson's rule is lost; its performance becomes comparable to the other two lower-order methods. This result demonstrates that the practical performance and convergence rate of a numerical integration method are fundamentally limited by the smoothness of the function being integrated.

Posted 2025-06-11Updated 2025-12-07Notea few seconds read (About 26 words)

数据挖掘考试纲要-中文

01:

02:

03:

04:

05:

06:

07:

10:

12:

13:

14:

15:

16:

Posted 2025-06-11Updated 2025-12-07Notea few seconds read (About 13 words)

数据挖掘考试纲要

01:

02:

03:

04:

05:

06:

07:

10:

12:

13:

14:

15:

16:

Posted 2025-06-09Updated 2025-12-07民以食为天a few seconds read (About 31 words)

!!老友记同款!!芝士蛋糕🧀

劳动力(6/14) - nyc - cyl - tzc - 寒州州州州老师 - lxg

Posted 2025-06-05Updated 2025-12-07Note10 minutes read (About 1569 words)

Homework 2= Curve Fitting

Exercise 1

(a) Polynomial Interpolation

To approximate the function $$f(x) = \frac{1}{1 + \exp(4x)} $$on the interval [-5, 5], polynomial interpolation was performed using polynomials of degree n=6 and n=14. Two types of nodes were considered: equally spaced nodes and Clenshaw-Curtis nodes, the latter computed using the formula

$$
x_i = \frac{a + b}{2} - \frac{b - a}{2} \cos\left(\pi \frac{i}{n}\right), \quad i = 0, \dots, n.
$$

Using polyfit, the interpolation polynomials were computed based on the function values at the chosen nodes. The polynomials were evaluated using polyval on a fine grid over [-5, 5] and plotted alongside the true function f(x). The results showed that interpolation with Clenshaw-Curtis nodes provided better approximation near the boundaries, especially for n=14, reducing the Runge phenomenon that is prominent with equally spaced nodes.

(b) Piecewise Linear Interpolation

For the same values of n=6 and n=14, piecewise linear interpolation was carried out using interp1 with equally spaced nodes. The linear interpolant $$p_{1,h}$$ was constructed over n subintervals of length $$h = \frac{10}{n}$$. The resulting piecewise linear functions were evaluated on a fine grid and compared graphically with the original function f(x). While less smooth than the high-degree polynomials, the linear interpolation captured the overall trend of the function and showed more stable behavior near the interval edges.

(c) Error Analysis

To quantitatively assess the interpolation accuracy, the maximum approximation error

$$
E_n = \max_{x \in [-5, 5]} |f(x) - p_n(x)|
$$

was computed for n=2 to 40 for both equally spaced and Clenshaw-Curtis nodes. Similarly, the maximum error for piecewise linear interpolation, $$E_{1,h}$$, was evaluated for the same range of n. A loop was used to iterate over n, computing and storing the errors. The results were visualized on a log-scale plot with respect to n. The plots clearly demonstrated that Clenshaw-Curtis interpolation achieves significantly better accuracy with increasing n, while equally spaced nodes lead to instability and large errors as n grows. Piecewise linear interpolation, although low-order, exhibited stable and predictable convergence.

Exercise 2

(a) Equally Distributed Nodes

The function $f(x) = \sin(x) + x$ was interpolated on the interval [0,10] using Lagrange polynomial interpolation. Two degrees were considered: $n = 4$ and $n = 15$, corresponding to $n + 1$ equally spaced nodes. For each value of n, two datasets were created: one using the exact function values $y_i = f(x_i)$, and the other using perturbed values $z_i = y_i + \epsilon_i$, where $\epsilon_i \sim \text{Uniform}(-0.1, 0.1)$. Using polyfit and polyval, the interpolation polynomials were computed and evaluated on a dense grid for visualization.

The results show that for n = 4, both the unperturbed and perturbed interpolants approximate the true function reasonably well, though the perturbed one deviates slightly. For n = 15, the interpolant using exact values exhibits oscillatory behavior near the boundaries, a clear manifestation of the Runge phenomenon. The polynomial interpolant using noisy data becomes highly unstable, demonstrating extreme sensitivity to small perturbations in the input values.

(b) Clenshaw-Curtis Nodes

The same procedure was repeated using Clenshaw-Curtis nodes computed using the cosine-based formula. For both n = 4 and n = 15, interpolation polynomials were fitted using the exact and perturbed data values. When visualized, the interpolants using Clenshaw-Curtis nodes performed significantly better near the boundaries compared to those using equally spaced nodes. Particularly for n = 15, the polynomial interpolant with exact values closely follows the true function, with reduced oscillations. Even when perturbations are introduced, the polynomial remains relatively stable and does not exhibit the extreme sensitivity observed in the equally spaced case. This confirms that Clenshaw-Curtis nodes enhance numerical stability in high-degree interpolation.

(c) Piecewise Linear Interpolation

Piecewise linear interpolation was implemented using interp1 on equally spaced nodes for both n = 4 and n = 15. As before, both the exact and perturbed datasets were used to construct the interpolants. The linear interpolants followed the general shape of the original function and remained visually close to the true curve even when noise was added. This behavior was consistent across both values of n. While less accurate than high-degree polynomial interpolation in smooth regions, the piecewise linear approach showed strong robustness to noise and did not exhibit oscillations or instability, making it a reliable method for interpolating noisy data.

Exercise 3

(a) Least Squares Polynomial of Degree 4

The file data1.mat was loaded to obtain the dataset $(x_i, y_i)$, where $y_i = \sin(x_i) + x_i + \epsilon_i$, and $epsilon_i$ represents Gaussian noise with zero mean and standard deviation $σ=0.1$. A degree-4 least squares polynomial $p^{LS}_4$ was fitted using polyfit(x, y, 4). On the same plot, the original function $f(x) = \sin(x) + x$, the noisy data points $(x_i, y_i)$, and the polynomial $p^{LS}_4(x)$ were displayed. The polynomial provided a smooth approximation that followed the general trend of the function, mitigating the effect of noise.

(b) Higher-Degree Least Squares Polynomials (m = 7, 15)

The procedure was repeated for polynomial degrees $m = 7$ and $m = 15$. For $m = 7$, the approximation improved slightly, with the polynomial capturing more curvature of the underlying function. However, at $m = 15$, overfitting became evident—the polynomial began to follow the noise, especially near the edges of the interval. This illustrates the typical variance-bias tradeoff in polynomial regression: higher degrees reduce bias but increase sensitivity to noise.

(c) Approximation Error vs. Degree

To assess the accuracy of the least squares fit, the error $E(p^{LS}m, f) = \max{x \in [0, 10]} |f(x) - p^{LS}_m(x)|$

was computed for degrees m = 1 to 1515, evaluated over a fine grid. The error was then plotted as a function of m on a semi-logarithmic scale. The plot showed a decreasing trend in the error for small m, reaching a minimum at some intermediate degree. Beyond that, the error increased, reflecting overfitting. This confirms that, for noisy data, the optimal degree is not necessarily high.

(d) Noise Variance Estimation and Error Comparison

The variance of the noise was estimated using the formula

$\hat{\sigma}^2 = \frac{1}{n - m} \sum_{i=1}^{n+1} \left( y_i - p^{LS}_m(x_i) \right)^2,$

with m = 4 and n + 1 = 20. The estimated variance $\hat{\sigma}$ was then used to compute the expected approximation error scale

$\hat{\sigma} \cdot \sqrt{\frac{m + 1}{n + 1}}.$

The previously observed minimal error matched this estimate in magnitude, validating the analytical approximation. A comparison with the theoretical value $\sigma \cdot \sqrt{(m+1)/(n+1)}$ was also performed, showing good agreement.

(e) Large-Scale Data: n + 1 = 20000

The analysis from parts (a) to (d) was repeated using the dataset in data2.mat, which contains n + 1 = 20000 samples. The results demonstrated a dramatic improvement in the stability and accuracy of least squares fitting. Overfitting effects were less pronounced even at higher polynomial degrees due to the large number of samples. The error curve in (c) showed a more gradual and consistent decrease, and the noise influence was better averaged out. Variance: 0.04814, Bound ≈ 0.00347

(f) Error Comparison Between Small and Large Datasets

The minimal approximation error achieved using the larger dataset was significantly smaller than that obtained from the 20-point dataset. This reflects the benefit of larger sample sizes in reducing noise impact and improving model robustness. Moreover, the estimated noise error scale $\hat{\sigma} \cdot \sqrt{(m+1)/(n+1)}$ decreased accordingly due to the much larger denominator, reinforcing the statistical intuition.

Exercise 4

the Lagrange interpolation error at any point x in the interval containing the nodes is given by

$f(x) - p_n(x) = \frac{f^{(n+1)}(\xi)}{(n+1)!} \prod_{k=0}^n (x - x_k)$
Taking the absolute value yields the error bound
$|f(x) - p_n(x)| \leq \frac{M}{(n+1)!} \left| \prod_{k=0}^n (x - x_k) \right|$

where $p_n(x)$ is the interpolating polynomial of degree n through n+1 points. For both functions, the interpolation degree was set to n=4, which requires the fifth derivative of the target function.

For the function $f_1(x) = \cosh(x)$, the interpolation nodes were defined as $x_k = -1 + \frac{k}{2}$for $k = 0, 1, \dots$, giving equally spaced points in the interval [−1,1]. The fifth derivative of $\cosh(x)$ is $\sinh(x)$, since derivatives of hyperbolic functions follow a repeating pattern. The maximum of $|\sinh(x)|$ over[−1,1] occurs at x = 1, where $\sinh(1) \approx 1.175$. Substituting into the error formula, the upper bound depends on this derivative value and the absolute value of the product of linear terms $(x - x_k),$ divided by $5! = 120$.

For the function $f_2(x) = \cos(x) + \sin(x)$, the interpolation nodes were chosen as $x_k = -\frac{\pi}{2} + \frac{\pi k}{4}$ for $k = 0, \dots, 4$, covering the interval [−2π,2π]. The fifth derivative of $f_2$ is $-\cos(x) + \sin(x)$, based on the periodic differentiation cycle of sine and cosine. The maximum of the absolute value of this expression was found numerically over the interpolation interval. This value was then used in the same error formula as above, with the factorial and the product of distances from the interpolation nodes.

In both cases, the error bound reflects how the function’s higher-order smoothness and the choice of interpolation nodes influence the potential deviation between the true function and its polynomial approximation. The approach confirms that even for smooth functions, the interpolation error can grow significantly away from the nodes if the derivatives are large or if the node distribution leads to large oscillations in the interpolating polynomial.

Posted 2025-05-24Updated 2025-12-07读读噜an hour read (About 12995 words)

《禅与摩托车维修艺术》读书会p10

边看边记的内容

摩托的浪漫所在

骑摩托车旅游和其他的方式完全不同。坐在汽车里，你只是被局限在一个小空间之内，因为已经习惯了，你意识不到从车窗向外看风景和看电视差不多。你只是个被动的观众，景物只能呆板地从窗外飞驰而过。
骑摩托车可就不同了。它没有什么车窗玻璃在面前阻挡你的视野，你会感到自己和大自然紧密地结合在了一起。你就处在景致之中，而不再是观众，你能感受到那种身临其境的震撼。脚下飞驰而过的是实实在在的水泥公路，和你走过的土地没有两样。它结结实实地躺在那儿，虽然因为车速快而显得模糊，但是你可以随时停车，及时感受它的存在，让那份踏实感深深印在你的脑海中。

随心而游

我们刻意避免按照固定的行程前进，宁可随心所欲地走走停停，因为旅游本身远比赶赴某一个目的地更加惬意。现在我们在度假，想走一走支线，石子铺的乡间小路是最好不过的选择了。然后才是州际干道，下下之选才是高速公路。我们打算好好欣赏一下沿途的风光景致，所以要好好享受旅游的过程，不会干那种在很短时间之内游览几个景点的煞风景的事。

思想的深度和宽度

我不打算在脑海里挖掘任何新的河道，只想把旧的想法疏通一番，因为它已经被腐败发臭的思想和陈旧观念堵塞。“有什么新鲜事儿？”这是一个人们最感兴趣的问题，但是也最不着边际，可以没完没了地问下去。如果认真探讨它的答案，所得的只不过是一堆琐碎的跟风事物，这些都是将来的淤泥。我宁可问这样的问题：“什么是最好的？”这个问题能疏通河道而非拓宽它。人类历史中有些时代，思想的河道挖凿得太深，以至于无法修改，从而再也无法出现任何新气象，这时追求“最好的”就成了僵化的教条－但我们的现状并非如此。目前的普遍思想似乎早已漫过两岸，丧失了主要的目标和方向，淹没了低洼地区，把高地孤立起来，切断了它和其他地区的联系。除了河水本身浪费精力的躁动外，像这样到处流溢并没有任何意义，所以目前似乎真的到了需要疏通的时候了。

水龙头

有一天我在他们家等着一起上路，我注意到水龙头在滴水，我记得上次就已经滴了，事实上已经滴了很久。我提醒他这件事，约翰告诉我，他换过新的皮圈但还是滴水，他说了这些就不再提了，也就是说事情到此为止。如果你试过修理水龙头，但是情况依旧，那就表示你命中注定有个会滴水的水龙头。
我很惊讶，水龙头这样日复一日、年复一年地滴滴答答地响，他们难道不会神经衰弱吗？然而我发现他们一点都不担心，也不去注意这件事。所以我的结论是他们不怕被水龙头打扰。有些人的确如此。我不记得是什么改变了这个判断······好像是思薇雅正要说话，而滴水声又特别大，无意中引起她情绪上的变化。她的声音一向很轻柔，而有一天她想大声说话压过滴水声，这时候孩子们走进来打断了她，她不禁发起脾气来，仿佛是滴水声引起的。事实上是这两件事引起的，而让我惊讶的是她并没有怪罪到水龙头上，她甚至有意不去怪罪它。其实她早已注意到水龙头的问题，只是刻意压制自己的怒气，那个该死的水龙头几乎要把她逼疯了！但是她仿佛有隐情，不肯承认这个问题有多严重。
我很奇怪，为什么要对水龙头压抑自己的怒火？

工作

最重要的线索似乎是他们脸上的表情。然而实在很难解释，虽然他们看起来很随和、友善、轻松自在，但是却没有投入工作之中，他们就像旁观者一样，你会觉得他们只是在那儿晃来晃去，然后接过别人递给他们的扳手。
他们对自己的工作没有认同感，不会说：“我是师傅。”一旦到了下午五点，八个小时一满，你知道他们会立刻放下手中的工作，即刻离开，然后尽可能地不去想他们的工作。

匆忙

我并不想仓促行事，因为仓促本身就是20世纪最要不得的态度，当你做某件事的时候，一旦想要求快，就表示你再也不关心它，而想去做别的事。

这个说法用于理解自己为何焦虑实在是太方便了。譬如，如果我急着读完这本书，那大概是因为我确实急着去实验室干活，或者也可能是因为我希望在周末前读完方便周末回家把书带给老爸看看。如果是前者，那么就把该干的事情搞定再来看书，不是因为实验室的事情重要，而是因为机械性的干活（至少目前负责的横向项目确实如此）确实可以以一种事不关己，旁观的，匆忙的态度快速完成，但看这本书显然不能如此。

鬼魂

认为欧洲人或是印第安人相信鬼的存在是一种无知，这是非常自然的，从科学的角度来看，这样的人仍然处在非常原始的状态之中。所以今天有人表示，相信鬼神的存在就会被别人认为是无知，甚至是头脑有问题，因为很难想象有鬼存在的世界究竟是怎样的。”

约翰同意地点点头，然后我又继续说。

“我个人的看法是，其实现代人未必比以前的人聪明，人的智商并没有多大改变，那些印第安人和中古世纪的人跟我们都差不多，但是彼此所处的环境不同；在以前的环境中，他们认为鬼神是存在的，就像现代人认为原子、质子、光子和量子是存在的。从这个角度来说，我相信有鬼，也就是说，现代人也有属于他们的鬼神，你知道的。”“这是什么意思？”

“比如说，物理定理、逻辑学······数的系统······几何代数等等，这些都是所谓的鬼魂，因为我们太相信了，所以它们看起来就是真的。”约翰说：“我认为它们是真的。”
克里斯说：“我不明白啊！”

于是我又继续说：“比如说，有人假设地心引力在牛顿发现之前就已经存在，这是一件非常自然的事，但是如果认为地心引力直到17世纪才存在，那就很愚蠢了。”

“当然。”

“所以这种定理是在何时开始存在的呢？它一直都存在的吗?”

约翰皱了皱眉头，不知道我要说什么。

我说：“我的意思是，在有地球之前，在日月星辰形成之前，在一切之初，地心引力就已经存在了。”

“当然。”

“地心引力也没有自己的质量，没有自己的能量，当时人尚未出现，所以也不存在于人的心灵之中。它也不在空间里，因为也没有空间存在，更不存在于任何地方－这个地心引力仍然存在吗？”

现在约翰可就不那么肯定了。

我说：“如果地心引力存在，那么说实在的，我就不知道什么是非存在了。我认为地心引力已经通过所有非存在的考验，你想不出地心引力有什么不符合非存在的条件，或是科学上有证明其存在的证据。然而一般人仍然认为它是存在的。”

约翰说：“我得好好地想一想。”

“我推测如果你继续想下去，你只会一直原地打转，一直原地打转，直到你想出唯一合理有意义的结论，那就是，在牛顿诞生之前，地心引力并不存在。不会有其他合理的结论。

“我的意思是，”我在他打断之前接着说，“就是地心引力定理只存在于人的心里，这也是一种鬼魂！对于别人所相信的鬼魂，我们很容易无知而且自负地就进行攻击，但是对于我们自己心中的鬼魂，我们却非常无知而且盲目地信仰着。”

“那么为什么所有的人都相信地心引力的确是存在的呢？”

“大家被催眠了，用比较正统的说法是，大家受了教育。”

“你的意思是老师把学生催眠了，让他们相信地心引力的存在？”

“正是如此。”

“听起来很荒谬。”

“在教室里，你听说过视线接触的重要性吗？每一位教育家都强调这一点，但是没有人会向你解释。”

我说，“我们相信，牛顿的理论早在他出生之前的几十亿年，就已经存在于宇宙的混沌之中，而他奇迹般地发现了这个理论。它一直存在着，虽然没有应用于实践。后来这个理论逐渐成形了，而且为人所运用。事实上这些理论就形成了世界。约翰，这种说法太荒谬了。

“而科学家所面临的矛盾是心。心既非物，也没有能量，但是他们并不能否认心存在于他们所做的一切之中。逻辑存在于心中，数字也只存在于心中。如果科学家认为鬼也只存在于人的心里，我不会反对这种说法。其中’ 只＇是一个关键词，科学只存在于你的心里，这种说法并没有错，鬼也是一样。”

他们还是看着我，所以我继续说：“自然的法则是人类发明的，就像鬼的存在一样。逻辑学、数学也都是如此，所有值得赞美的事，也都是人类的发明。这个世界也是人类所想象出来的，整体来说也就是一种灵界的存在。在古代，我们所居住的这个美妙的世界就被如此视之，它由鬼神所统领，我们之所以能看到这个世界，就是因为鬼神让我们看见，他们是摩西、耶稣基督、释迦牟尼、柏拉图、卢梭、杰弗逊、林肯等等，牛顿是非常好的一位，可算其中最好的一位，所以我们的常识就是由过去成百上千的鬼神所构成的，他们企图在人的生命当中找到他们的地位。”

对现代理性社会的感恩?

虽然天气很冷，但不至于这么冷，约翰和思薇娅是怎么度过明尼苏达的寒冬的呢？我纳闷。从这里我们可以发现明显的矛盾，如果他们无法忍受生理上的不适，而同时又无法接受科技的成果，那他们一定得做些让步。他们一方面需要科技，一方面又诅咒它。我相信他们很明白这一点，而这正是他们对整个环境不满的原因。他们并没有给出一个合理的论点，只是做出直接的反应而已。
现在三个农夫们进城炫耀着他们新捣鼓来的卡车和洗衣机，他们珍惜科技却又是最不需要科技的一群人，失去了科技他们可能日子不好过，但可以活得好好的。而约翰，思薇娅，克里斯和我可能一个礼拜之内就死了。这样诅咒科技是不敬的，但是情况就是如此。
又钻进死胡同了。如果有人不懂心存感激，而当你当面告诉他，那么就等于是在骂他，这样你什么事情都解决不了。

确实是生活中有太多相关实例了。

我看到像约翰和思薇雅这样的人，在整个文明的理性结构下，活得很盲目而且很疏离。他们想要从这个结构之外寻找答案，但是却找不到持久而令人满意的答案。

啤酒罐

我说：“你应该用薄铁片垫一下。”

“什么薄铁片？”

“就是一片扁平条状的薄铁片，把它塞在把手的缝隙里，这样就会使把手更紧。通常在修理各种机器的时候都会用到它。”

“喔，”他有点感兴趣，“很好，那么要到哪儿去买呢？”

“我这儿有。”我很高兴地说，拿起了一个啤酒罐。

他一时明白不过来，然后说：“什么？就是这个啤酒罐？”

“没错，”我说，“世界上最好用的垫片。”我自认为这一点很聪明，省得他到处去找买垫片的地方，也节省了他的时间和金钱。

但是我很惊讶的是，他竟然没有发现它的妙用。事实上他对这件事的态度一直很傲慢，找各种理由来搪塞我，后来我才发现他真正的态度。最后我们决定不修车把了。

据我所知把手仍然会松。不过我知道当时他的确很生气，我竟敢用啤酒罐的薄片去修理他花一千八百美金买来的全新的宝马车！这辆车代表的是半个世纪以来德国人在机械上的精良水准。此后我们就很少提到维修摩托车的问题，现在回想起来，应该是根本就没有再谈过了。

我应该这样向他解释，这个啤酒罐是铝做的，不但材质很软，而且附着性很好，在这种情况中最适合使用，而且它不会受潮氧化，说得更仔细一点，它的表面有一层氧化物，可以防止进一步的氧化。

换句话说，任何一位拥有精良的机械技术的、真正优秀的德国技师，都会认为这个解决办法最好不过了。

后来我想了一下，我应该偷偷地走到工作台，切下一部分啤酒罐，把上面的印刷除掉，然后回来告诉他，我们很幸运，只剩下一片了，还是由德国进口的。这样就成了。它是由德国巴伦·艾佛德·克鲁普公司制造的，我以特价买到了。这样他就搞不清楚究竟是怎么一回事儿了。

观念的摩托车

精确的仪器是为了表达一种理念而设计的，如果你想要在空间上达到完美的境界是不可能的。因为摩托车没有任何一部分能够达到完美，但是如果你很接近完美，就会有令你惊讶的事发生，因为它可以在极限之内，奇妙地飞驰过乡村田野。所以最基本的就是要了解这种理念。约翰看到摩托车的时候，只看到各种不同的结构，于是就厌恶它，然后拒绝进一步的接触。但是在我的眼睛里，我却看到设计者的理念。约翰认为我接触的是各种零件，实际我接触的是各种观念。

修理摩托车

一名没有受过训练的旁观者只看到修理人员所付出的劳力，就以为他最主要的工作在于劳力。事实上，这正是他最轻松也是工作上最小的一部分，他最重要的工作就在于仔细观察和精确思考，这就是为什么技术人员往往显得沉默寡言，甚至在做实验的时候有些畏缩。

一个人在修理摩托车的时候，对车子的了解分分秒秒都在改变，因而得到了全新认识，其中蕴含了更多的良质。修理的人不会受限于传统的做法，因为他有足够理性的基础拒绝这些思想。其实不再是静态的，它不是让你决定是要去奋战还是打退堂鼓的思想，它们是会跟着你成长的思想。所以具有良质的事实，它的本质不再是静态的，而具有爆炸性的威力，一旦你了解了这一点，就永远不会被卡住。

不论你被卡得多严重，这种现象终将消失。你的心灵终究会很自然地找到解决的办法，除非你非常容易被卡住。其实怕被卡住是不必要的，因为被卡住得愈久，你就愈看得清楚让你脱困的良质。
所以不应逃避被卡住的情形，它是达到真正了解之前的心灵状态。要想了解良质，不论是在技术荼上或是其它方面，无私地接纳这种被卡住的现象是个关键。无师自通的技术人员就是因为常常被卡住，才比接受学院训练人员更了解良质。因为他们懂得如何处理突发的状况。

（接{高级的手艺}中的宁静）
所以在维修摩托车的时候，最重要的是要培养内心的宁静，让自己不要和工作环境疏离，在做其他的工作时也是同样的。这一点做到了，其他的一切就会变得很自然。内心的宁静会产生正确的价值观，正确的价值观就会产生正确的思想，正确的思想就会产生正确的行动，而采取了正确行动的工作，便可使别人从中看到做人内心的宁静。

良质

良质……你知道它是什么，然而你又不知道它是什么。这是自相矛盾的。如果有一些事情比其他的要好，那就是说它们的等级比较高。但是一旦你想解说良质，而不提拥有这种物质的东西，那么就完全无法解释清楚了。因为所说的根本就没有内容，但是如果你无法说出良质究竟是什么，你又如何知道它是什么呢？或者你怎样才知道它存在呢？如果不知道究竟是什么，那么从实用的角度来说，它根本就不存在，而实际上它的确存在。那么等级的根基又在哪里呢？为什么有些人愿意花更多钱去买这些东西，而把另一些东西丢到垃圾桶里呢？很明显地，有些东西的确比其他的东西要好，但是什么又是比较好呢……你的思想一直在打转，找不到出路。究竟良质是什么呢？它是什么呢？

关于良质，斐德洛探讨两个重点：
1）他不想建立一种僵化而系统的定义，所以良质的这一面是快乐的，充满成就的和富有创意的。他在我们身后山谷里的学校教书的时候，大部分的时光都是如此。
2）因为一般人批评他对于自己所探讨的内容缺乏定义，于是他提出对于良质的系统而刻板的定义，从而建立起庞大的思想体系。他绞尽脑汁地建立起有关生存的系统解释之后，让我们对它的了解远远超过了从前。

但是你如何用理性去界定拒绝被界定的事物呢？定义就是理性的基础。有理性就有定义，他可以利用辩证法的战术和无能与否的侮辱暂时压制住别人的攻击，但是迟早他得提出一些更实在的理念，引导结晶继续进行，超越传统修辞学的范畴，而进入哲学的领域。

要想从哲学方面解释良质，是一件既对又错的事。因为这是一种哲学的解释。哲学解释的过程就是分析，把一样东西细分成主语，述语。我的意思是，良质这字眼不能分解成主语和述语，这不是因为良质是神秘的，而是因为良质是非常简单，迅捷而直接的感应。
要让我们这种背景的人了解纯粹的良质，用最简洁的语言形容就是“良质是有机体对环境的反应”。

良质就是佛，良质就是科学的实体，良质也是艺术的目标。这些观念仍然需要融入日常生活当中。而最简单的方法莫过于我一直提到的——修理摩托车。

技术人员的好坏，就像数学家的好坏一样，取决于他在良质的基础上选择好坏的能力。所以他必须懂得关心。

爬山

对没有辨识力的人来说，自我的爬山和无我的爬山看上去可能都一样，都是一步一步地向上爬；呼吸的速度也一样；疲累的时候都会停下来；休息够了又会继续向前行。但是事实上两者多么不同啊！自我的爬山者就像一支失调的乐器，他的步伐不是太快就是太慢，他也可能失去欣赏树梢上的美丽阳光的机会。在他步履蹒跚的时候却不休息，仍然继续前进。有的时候，刚刚才观察过前面的情况，他又会看一遍。所以他对周围环境的反应不是太快就是太慢。他谈话的话题永远是别的事和别的地方。他的人虽然在这里，但是他的心不在这里。因为他拒绝活在此时此地，他想要赶快爬到山顶，但是一旦爬上去之后仍然不快乐，因为山顶立刻就变成“此地”。他追求的，他想要的都已经围绕在他的四周，但是他并不要这一切，因为这些就在他旁边。于是在体力和精神上，他所跨出的每一步都很吃力，因为他总认为自己的目标在远方。

高级的手艺（禅意）

保持内心的宁静在机械工作上并不是一件小事，它是工作的核心。能够使你平静的就是高级的手艺，反之，则是低级的。

我所谓的内心之宁静，和外界的环境并没有直接的关系。出家人打坐，士兵在隆隆的炮击声中，或者是机械人员正在做万分之一英寸的校准，都可能产生内心的宁静。它涉及到一种自然的态度，让人与周围的环境完全融合在一起。这种融合有许多等级，而宁静也有许多等级，你的功夫愈深，就愈了解它的深奥和困难度。

内心的宁静有三种等级，生理上的宁静虽然也有许多等级，但似乎是最容易达到的境界。精神上的宁静，也就是消除个人的杂念，相对来说不太容易做到，但是仍然可以达成。至于价值方面的宁静，也就是一个人没有贪念，只是单纯地过着自己的日子，这一点似乎是最难的。

佛教的禅宗提倡打坐，就是要使人物我两忘。而在我所提到的摩托车维修问题上，你只要专注地修理车子，就不会出现物我对立的情况。一旦真正地投入了工作之中，就可以说是在关心自己的工作，这就是关心的真正意义——对自己手中的工作产生认同感。当一个人产生这种认同感的时候，他就会以看到关心的另外一面——良质。

链接

https://www.briancoords.com/zen-art-motorcycle-maintenance-book-review-notes/
https://www.ruanyifeng.com/blog/2011/12/zen_and_the_art_of_motorcycle_maintenance.html

正文

根据笔记的创建是时间，这本书已经是将近9个月前买的了，我在4个月前才陆陆续续看完。书的后半部分看得真的很勉强，充斥着各种哲学思辨和意识流的叙述。。。这一切对于我一个没系统研究过哲学的典型工科生实在是看得有些累，所以确实也想把这本书推荐给大家读一读，看看有没有大儒能帮我理解一下。。。
其实看到这本书真的就只是纯粹被书名吸引了（有种似曾相识的感觉）。禅与摩托车的结合本身就充满了张力，就国内的对于摩托族的认识而言，摩托车又会给人一种追求个性，心浮气躁的人（比如说很多摩托车改装群里面聊的最多的就是如何改排气管炸街），真的很难和“禅”这种宁静致远的感觉结合起来。但其实到后面发现可能上世纪美国的摩托文化和当下国内的有相当区别。诶有的时候确实会因为文化差异导致理解有偏差。

确实也是这本书的内容真的很杂乱，是一本很难看完的一本书，我在准备本期读书会之前也是，只能大致记得这本书是关于一对父子约上两个好友骑摩托车横跨美国的故事，然后作者在途中开始探索有关“良质”的理论。以下内容是我看了一些豆瓣的书评和书摘总结的。

先大致说一下这本书的脉络吧。

本书可以分成四个部分，
第一部分讲了作者一行从明尼阿波利斯出发，到蒙大拿州的迈尔斯的旅程。这一行人包含作者的儿子克里斯以及他的同伴，约翰夫妇。第一部分其实读起来比较好理解，颇有种公路片的风格，四个人，两辆摩托车，风雨无阻地疾驰在乡间旷野。这部分作者主要写了一些他对于摩旅的感悟：

期间，因为海拔或者天气原因，摩托车也经常会出现一些故障，比如约翰的摩托车把手松动问题，或者作者自己摩托车的点火问题等等。对于这同样的问题，约翰和作者的态度截然不同。作者认为在维修过程中，通过用心倾听发动机的声音、观察运行状态、思考解决方案，可以达成一种与事物深度“交融”的状态，这是一种可以体验心流，可以进入禅宗状态的艺术，这个听着就非常的古典。与之相对的，约翰他作为一个对于摩托车只有“浪漫理解”的人，认为不应该弄脏自己的手，专业的事情就交给专业的修车师傅就好。这样的冲突在第一部分占了很大的部分。
这里就随便举一个书中的事例：
作者发现约翰的摩托车把手松动的问题，使用扳手紧固也无济于事，于是他提出可以使用一个薄铁片垫一下，把它塞在把手的缝隙里，这样就会使把手更紧。约翰就问：“很好，那么要到哪儿去买呢？”，作者拿起一个啤酒罐表示可以直接从这个管子上取材，他认为这是世界上最好的垫片。对此约翰非常生气，因为作者竟敢用啤酒罐的薄片去修理他花一千八百美金买来的全新的宝马车！在他看来，这辆车代表的是半个世纪以来德国人在机械上的精良水准。最终约翰也没有采纳这个建议，而作者认为任何一位拥有精良的机械技术的、真正优秀的德国技师，都会认为这个解决办法最好不过了。

作者后续分析认为，他与同伴约翰（“反科技”的代表）之间的差异，是基于两人的出发点和角度不同，他俩分别代表了理性、知识的角度（意义）和直觉、当下的角度（外观）（“一种是你当即感受到的艺术表现，另一种是隐藏其中的科学道理”）。他认为，科技让人产生对现实的认定上的冲突，让一部分人无法面对。由此，引发了作者对于“二分法”的探讨。作者认为“二分法”——即古典的认知和浪漫的认知，是问题的根源。浪漫的认知主要有丰富的灵感、想象力、创造力和直觉。古典的认知往往依赖于理性和法则。所以虽然骑摩托车旅行是件浪漫的事，但是修摩托车却全然是古典的行为。人在思考和感觉的时候往往会偏向于某一种形式，而且会误会和看轻另一种形式。作者认为，在科技背后，所有现代科学、西方思想背后的东西，就是理性本身。而这就是作者想要在这次旅程中探索的主题。

除此之外作者提到，如果你去专业机构修车，那些技术人员经常是把你这个毛病修好了，却因为干活粗心把别的东西弄坏了。他们虽然不像约翰和思薇雅一样害怕科技，他们都是专门人员，然而做起事来却像猩猩一样，没有真正地投入。
之所以说没有投入，作者给了几条线索，比如说收音机，动作的速度之类的。其中最重要的线索是他们脸上的表情。作者说，然而实在很难解释，虽然他们看起来很随和、友善、轻松自在，但是却没有投入工作之中，他们就像旁观者一样，你会觉得他们只是在那儿晃来晃去，然后接过别人递给他们的扳手。他们对自己的工作没有认同感，不会说：“我是师傅。”一旦到了下午五点，八个小时一满，你知道他们会立刻放下手中的工作，即刻离开，然后尽可能地不去想他们的工作。在这一方面，他们与约翰和思薇雅一样，虽然想运用科技的成果，但是却不愿和它发生任何关系。或者说他们之间的确有关系，但是他们都没有投身其中，而保持冷淡疏离的态度，他们参与了这方面的工作，但是却没有真正地关心它。

除此，摩托车维修手册的编写人员也是差不多的情况，作者在学修摩托车时，找来相关修理手册参考，结果发现手册都编得乱七八糟。说明编手册的人和粗心的修理工一样，都是以旁观者的心态在做事。因此，作者得出结论：他们对自己的工作没有认同感。没有认同感，就无法做到真正关心自己的工作对象。

对于工作对象的关心是我觉得这本书里非常重要的一个内容，对此作者有一个对于关心的反面，也就是仓促的描述：
他说仓促本身就是20世纪最要不得的态度，当你做某件事的时候，一旦想要求快，就表示你再也不关心它，而想去做别的事。

那么，什么是修摩托车的正确态度呢？在作者看来，摩托车修理技术往往也是一连串推理过程。老手修车，不会死板地完全照说明书去做，他会边做边取舍，这就融入了思考，所以要求人全神贯注，手部动作和机器状态之间要有自然的和谐（也就是手感）。修理人员凭手感调整思路，决定下一步动作，在维修过程中，机器和他的思想在不断地发生改变。这其中的过程，就像某种艺术的建构。
所以在维修摩托车的时候，最重要的是要培养内心的宁静，让自己不要和工作环境疏离，在做其他的工作时也是同样的，要做到关心自己的工作。这一点做到了，其他的一切就会变得很自然。内心的宁静会产生正确的价值观，正确的价值观就会产生正确的思想，正确的思想就会产生正确的行动，而采取了正确行动的工作，便可使别人从中看到做人内心的宁静。

在这一部分中，作者提到了本书的关键人物——斐德洛，他是一个鬼魂，在这里作者只是评价他是一个狂热地追寻理性的猎人。

第二部分讲了作者一行人从蒙大拿州的迈尔斯出发到博兹曼的旅程。从这一部分开始，作者开始了摩托车之旅和斐德罗人生经历的交替描写。这一块真的比较杂，作者提出了很多观点，比如他认为摩托车的维修艺术就是研究理性艺术的缩影。这里我就不展开来讲了，真的是比较抽象的内容。除此之外的主要都是斐德罗这个鬼魂的人生经历：

斐德洛在十五岁时就完成大一的科学课程，他关心的疑问之一，是如何理解爱因斯坦的描述——“唯有建立在对经验的深切理解之上的自觉，才可以建立普世的基本法则，而无法通过逻辑推理获得。“ 斐德洛认为，科学真理的时效是科研投入的反比函数，造成了科学真理寿命缩短的原因，最主要的就是假设的增加。科学把人从唯一绝对的真理，引向多元、摇摆不定、相对的世界，是造成社会混乱、价值混淆的主要元凶。在此基础之上，斐德洛对理性提出批判，他认为传统的理性结构已经不符合时代所需，本质上在情感上是空虚的，在美学上没有任何表现，在灵性上更是一片空白。从这里开始，斐德洛意识到原本追寻的真理是侧面的真理，而不是科学正面的真理。同时，斐德洛在一所大学任职，试图开展”理性教堂“一般的修辞学课程，他相信大学的本质在于流传下来的理性。在这期间，顺着理性他不断向时间的上流追溯，阅读了一本本的哲学著作，遇见了一个个的思想体系，直到他遇上了二分法。

所谓二分法，就是把事实或者说现象分成两个部分：古典的和浪漫的。这里的古典对应的是当下控制着人类文明的理性，是条分缕析、严丝合缝的理性，是由不得一点意外的理性；而浪漫则是非理性的，是情绪化的，没有条理的。在现代文明之前，人类社会是非理性的，是宗教的，是无知的；正是因为柏拉图、亚里斯多德等希腊哲学家把古典提取了出来，才让真理得以焕发出光彩，进而发展出了以科技为基础的现代文明。

「斐德洛」在二分法面前徘徊，他发现这块说服了无数人的基石其实脆弱无比，而且不仅脆弱，甚至险恶：因为这基石把一个永恒给埋葬了。
于是，他疯了，他要把这现代文明的基石击碎；竟然要否认历代先贤的思想成果，他不仅击溃了亚里斯多德、柏拉图，他还冲向了苏格拉底，站在了被苏格拉底的对立面，和那些被埋葬在故纸堆里的修辞学家并肩作战。他简直就是要颠覆整个希腊古典文明，进而颠覆整个现代文明。但有所解构就需要有所建构，对此，斐德罗尝试提出了良质（quality）的概念。

那么到底什么是良质呢，对此斐德罗并没有给出明确的定义，因为在斐德罗的思想里，良质是高于当下哲学体系的。从某种意义上说「良质」是万物的源头，是万物的父亲，既然是父亲，又怎么可能被儿子甚至孙子的框框架架所圈住（定义）；更何况，它不仅是源头，它其实也是万物。
这其实就导致没法单纯使用理性去理解良质，如果一定要用子辈的概念来描述，「良质」相当于至真、至善、至美。是不用教学、不用指导就自然而言能够感受的最高级的真、善、美。
因为比它低级的真，比如物理、数学定律，是需要证明和假定的，而它不需要，它就这样先验性地存在；比它低级的善，比如让座、孝顺，是需要教化，需要引导的，而它是非常自然的行动；至于那些需要依靠解读才能领略的美，就更没法和它相比了。

纯概念的话听着真的非常云里雾里，在这里书中的第三部分（从作者和儿子克里斯爬山的经历开始讲起，到骑行到西海岸本德城结束），作者提了一些斐德罗通过教授修辞学来探索和思考良质的例子：

一位学生无法写出关于家乡的一篇短文。斐德罗建议她缩小范围——从她的家乡写到“主街上的一栋房子”，再到“这栋房子上一块砖”。突然，这位学生文思泉涌。由此，斐德罗认为，写作的困难并非来自知识匮乏，而是对“良质”感知的阻塞。良质在此时初步被揭示为一种“经验中的亮点”或“内在感知的正确性”。
在教学中，斐德罗发现无法用标准的评分方式来“定义”一篇好文章，但他却能凭直觉判断出哪些学生的文章“好”，哪些“差”。这引发了他对“良质”的深度思考：它不是完全主观的偏好；也不是可以由客观规则度量的属性。良质先于主客体划分而存在，是所有价值判断的源泉。
在教授的后期，斐德罗开始鼓励学生：抛弃条条框框的“主题句”、“三段论”；关注自己对某个主题的“真实反应”；写出自己“真正认为重要的东西”。这种教学实践体现了他对良质的信仰。他认为每个人对良质有内在的感知能力，而教育的任务不是传授技巧，而是唤醒这份感知。

然后在此之后（忽略了一些登山的感悟）开始了一些对良质本身的探讨。
斐德洛对“二元论”提出了不同的见解，他认为这个世界是由心、物和良质三位一体的。良质是主观意识到客观的存在时所发生的事件。作者进一步提出，科技专家和反科技的人都缺乏关心之情；而且，如果关心和良质是一体的两面，那么可以推论出，今天在科技上出现的根本问题，就在于科技专家和反科技的人都缺乏在科技中洞察良质的能力，斐德洛狂热地研究良质这个词在理性、分析以及科技方面的解释，其实就要给科技的根本问题找出答案，这其中隐含了整个人性与科技之间的问题。真正的丑陋在于发明科技的人与他们所制造的产品之间的关系。同样的状况也出现在使用科技的人和产品之间的关系上。作者对良质进行了举例，如果在盖工厂，修摩托车，或者治理一个国家，你必须对工作的品质有某种感受，你必须能判断什么才是好的，这一点才能带你前进。不断改变的良质才是现实。为了亲近良质，人需要避免价值僵化，一个人要保有“热忱”，同时不断重新审视过去你认为重要的事物是否仍然重要。避免“自我”、“焦虑”、和“厌倦”。先让自己变完美，然后再顺其自然地画出来，这就是所有专家的方式。在维修的车子时，其实是在维修“我们自己”。

最后的第四部分讲了作者抵达西海岸之后的旅程和斐德洛经历的“结局”。然后揭开了本书最大的悬念，也就是斐德罗到底是谁。书中写道，斐德洛就是在接受精神治疗前的作者本人，他在芝加哥大学的博士求学经历中，他本人对研究的项目怀有巨大的期待，然而他的主张却与主任完全相反，这使得他在求学中经历了剧烈的自我冲突和斗争，最终导致他放弃了这一切，也导致他精神分裂和抑郁症的爆发。

以上就是全书大致的内容了，可能忽略了不少东西，比如克里斯和作者的关系，爬山时的感悟等等。。我确实总结完上面这些就已经筋疲力尽了。。。这些后面大家可以补充啦。
其实对于这本书，我感觉确实初次听到它提到的良质时感觉非常不知所云，我不理解怎么可能用一个无法被定义的概念来驳斥理性社会建构的基石，但读完之后确实能感觉到每个人心底都会有良质的感知能力，并且读完我确实感觉那种感知能力被很大程度唤醒了。也让我更有力量，有勇气去遵循我内心的偏好。真要说的话就是无论是科研还是生活都有了方向感，也拥抱了久违的宁静。

拥有了平静，人才能对自己所做的工作不再疏离，才有了认同；即使在某个阶段出现了枯燥感（这是不可避免的），依然能够调整到积极的情绪，倾听内心的良质指引，遵循它，让自己手中的工作成为一种艺术。自己也就随之成为了一个更有趣的人，他和其他人之间的关系就不再是一种物化的关系，良质也会通过他传播到其他人心灵上，感受到的人会把它继续传播，这样良质也就得以不断繁衍。

我之前看到豆瓣有一个书评我觉得写得特别好：

世上大致有两类书，一类是里面写着你根本不知道的信息，读这类书就如在茫茫的信息草原上信马由缰，走到哪都有新奇；另一类书则是一片已知的大陆，只不过这大陆并不明晰，浅浅地附着在你的脑海里，还在阅读这类书可以帮你走进这片阴影，明确无误地走近它。

第一类书是优秀的，它能够为你提供不一样的信息和知识，而当你走到草原的尽头时也该和它作别了，因为它已经作为数据，存储进你的大脑里。

第二类书是卓越的，它不会给你新的东西，不过它可以把你内心的东西擦拭干净，让你重新认识潜伏于内心深处的宝藏：比如真，比如善，比如美——这些你早已遗忘在角落里的宝藏。

好在，拥有第二类书，你总可以再次拭去污渍，把这些宝藏从泥泞中寻觅出来。

这本书就属于第二类，它能够帮你把良质梳理得更清晰，它就像一块抹布，你每用一次，良质就更透亮。

Posted 2025-05-23Updated 2025-12-07Note3 minutes read (About 435 words)

Pixtral 12B API Inference

Repository:
https://github.com/PSGBOT/pixtral-12B-Inference

本地图片上传

def encode_image(image_path):
    """Encode the image to base64."""
    try:
        with open(image_path, "rb") as image_file:
            return base64.b64encode(image_file.read()).decode('utf-8')
    except FileNotFoundError:
        print(f"Error: The file {image_path} was not found.")
        return None
    except Exception as e:  # Added general exception handling
        print(f"Error: {e}")
        return None

Prompt

VLM物体描述的prompt:

核心需要：准确定位物体所在方位，不把远景识别为物体，降低False Positive

Focus on the area highlighted in green in the image.

Step 1: Determine if the highlighted area represents a distinct, identifiable object or instance:
- If the highlighted area is clearly a distinct object, proceed to Step 2.
- If the highlighted area is abstract, ambiguous, or you cannot confidently identify it as a specific object (e.g., part of background, texture, partial view), respond with "Valid: No".

Step 2: If the highlighted area is a distinct object, provide:
1. The specific name of the object (be precise and use technical terms when appropriate)
2. The primary function or purpose of this object
3. Any notable features visible in the highlighted area (no color description)
4. If there is text visible on the object, include what it says

Remember, if you're uncertain about the highlighted area being a distinct object, respond only with "Valid: No".

输出结果：

Valid

Valid: Yes

1. The specific name of the object: Soap dispenser
2. The primary function or purpose of this object: To dispense liquid soap or hand sanitizer.
3. Notable features visible in the highlighted area:
	- The dispenser has a pump mechanism at the top.
	- The body of the dispenser is cylindrical.
	- The material appears to be translucent plastic.
4. There is no visible text on the object.

invalid
1
Valid: No

VLM输出->Structured Output

使用另一个LLM来对VLM输出的内容进行parse，转化成json文件, 通过mistral ai 提供的接口实现:

class Instance(BaseModel):
    valid: str
    name: Optional[str] = None
    feature: Optional[List[str]] = Field(default_factory=list)
    usage: Optional[List[str]] = Field(default_factory=list)

def parse_description_msg(msg):
    message = [
        {"role": "system", "content": "Extract the description information."},
        {
            "role": "user",
            "content": msg,
        },
    ]
    return message

chat_response = self.client.chat.parse(
	model=self.llm,
	messages=msg,
	response_format=Instance,
	max_tokens=self.llm_max_tokens,
	temperature=self.llm_temperature,
)
return json.loads(chat_response.choices[0].message.content)

Homework 5 Report

Exercise 1

a)

Ex 1

a

d

Ex 2

a

b

d

e

Exercise 1

Problem Analysis

a) Solution for First-Order Accuracy

b) Solution for Second-Order Accuracy

Exercise 2

a) Midpoint Formula Calculation

b) Convergence of the Midpoint Formula

c) Convergence of Trapezoidal and Simpson’s Formulas

d) Analysis for a Non-Smooth Function

Exercise 1

(a) Polynomial Interpolation

(b) Piecewise Linear Interpolation

(c) Error Analysis

Exercise 2

(a) Equally Distributed Nodes

(b) Clenshaw-Curtis Nodes

(c) Piecewise Linear Interpolation

Exercise 3

(a) Least Squares Polynomial of Degree 4

(b) Higher-Degree Least Squares Polynomials (m = 7, 15)

(c) Approximation Error vs. Degree

(d) Noise Variance Estimation and Error Comparison

(e) Large-Scale Data: n + 1 = 20000

(f) Error Comparison Between Small and Large Datasets

Exercise 4

边看边记的内容

摩托的浪漫所在

随心而游

思想的深度和宽度

水龙头

工作

匆忙

鬼魂

对现代理性社会的感恩?

啤酒罐

观念的摩托车

修理摩托车

良质

爬山

高级的手艺 （禅意）

链接

正文

本地图片上传

Prompt

VLM物体描述的prompt:

VLM输出->Structured Output

Archives

Recents

Tags

高级的手艺（禅意）