Chen Yulin's Blog

Posted 2024-10-11Updated 2025-08-15Note5 minutes read (About 677 words)

VG501 Description of Experimental Method and Details

Background Introduction to the Methodology: Motion Planning with Transformers

Motion planning is a critical aspect of robotics, enabling robots to navigate complex environments while avoiding obstacles and achieving specific goals. Traditional motion planning methods, such as sampling-based planners, often struggle with high-dimensional tasks and long-term planning due to their reliance on random sampling and tree expansion techniques. Recent advancements in deep learning have led to the development of neural motion planners, which leverage neural networks to learn from demonstrations and generate efficient paths. Among these, the Motion Planning Transformer (MP-Former) framework stands out by treating motion planning as a sequence-to-sequence generation task, utilizing the transformer architecture to capture long-term dependencies and improve path quality.

Experimental Method: Evaluation of MP-Former

In this study, we conducted a series of experiments to evaluate the performance of the MP-Former framework in generating collision-free, near-optimal paths for robotic manipulation tasks. The experiments were designed to compare MP-Former against traditional sampling-based planners and previous neural motion planners in terms of efficiency, path quality, and robustness.

Experimental Details

Sample Information

The experiments utilized a synthetic dataset comprising various complex environments represented as point clouds. Each environment included multiple obstacles and varying configurations to challenge the motion planning capabilities of the algorithms.

Procedures

Dataset Preparation: A synthetic dataset was generated, consisting of 100 unique environments with diverse obstacle arrangements. Each environment was represented as a point cloud, simulating real-world scenarios.
Algorithm Implementation: The MP-Former framework was implemented alongside traditional planners (e.g., RRT and A*) and previous neural planners (e.g., MPNet). Each algorithm was configured to operate under the same conditions for a fair comparison.
Path Generation: For each environment, the algorithms were tasked with generating paths from a predefined start configuration to a goal configuration. The MP-Former generated paths using its transformer-based architecture, while the traditional planners utilized their respective sampling and tree expansion methods.
Data Recording: The following metrics were recorded for each generated path:

Path length (in meters)
Number of collisions encountered
Time taken to generate the path (in seconds)
Success rate (percentage of successful path generations without collisions)

Instruments and Equipment

Computational Resources: Experiments were conducted on a high-performance computing cluster equipped with NVIDIA GPUs to facilitate the training and evaluation of the MP-Former framework.
Software Tools: The implementation of the algorithms was carried out using Python with libraries such as TensorFlow for neural network training and Open Motion Planning Library (OMPL) for traditional planners.

Repetitions and Parallel Experiments: Each algorithm was executed 50 times per environment to ensure statistical significance. The results were averaged across these repetitions to provide a comprehensive evaluation of each method’s performance.

Results and Analysis

The results of the experiments were compiled into tables and graphs to illustrate the performance differences between the MP-Former and the other planners.

Table 1: Performance Metrics Comparison

Algorithm	Average Path Length (m)	Average Time (s)	Success Rate (%)	Average Collisions
MP-Former	5.2	1.5	95	0.2
RRT	6.8	3.2	80	1.5
A*	6.5	2.8	85	1.0
MPNet	5.9	2.0	90	0.5

Justification of Methodology

The choice of using the MP-Former framework for motion planning tasks is justified by its ability to efficiently generate high-quality paths while maintaining robustness in complex environments. By treating motion planning as a sequence-to-sequence generation task, MP-Former effectively captures long-term dependencies, reducing the risk of falling into equilibrium points that can hinder traditional planners. The experimental results demonstrate that MP-Former outperforms traditional methods in terms of path quality, efficiency, and success rate, making it a valuable approach for robotic motion planning.

Reference

Li, B., Wang, R., Chen, Y., Feng, B., Zhou, Q., & Bi, Y. (2025). Motion Planning Transformers (MP-Former): Treat Motion Planning as Sequence-to-Sequence Generation. IEEE International Conference on Robotics and Automation (ICRA).
Karaman, S., & Frazzoli, E. (2011). Sampling-based algorithms for optimal motion planning. The International Journal of Robotics Research, 30(7), 846-894.
Gammell, J. D., Srinivasa, S. S., & Barfoot, T. D. (2015). Batch informed trees (BIT*): Sampling-based optimal planning via the heuristically guided search of implicit random geometric graphs. IEEE International Conference on Robotics and Automation (ICRA).

Posted 2024-10-08Updated 2025-08-15Note6 minutes read (About 967 words)

Hexo AI Assistant

参见 https://dingfen.github.io/2024/07/21/2024-7-21-ai_assist/

阿里云的大模型之前在搞wechat bot的时候接触过，这次再加上一个云函数的服务，就搞定了。
LLM:https://bailian.console.aliyun.com/?spm=a2c4g.11186623.0.0.2db33048PGTSE6#/app-center
云函数:https://fcnext.console.aliyun.com/applications?spm=5176.fcnext.0.0.65f378c8HmZpOP

在本地主要需要做的是在hexo中注入代码
添加文件scripts/injector.js

hexo.extend.injector.register('body_end',`
<link rel="stylesheet" crossorigin href="https://g.alicdn.com/aliyun-documentation/web-chatbot-ui/0.0.11/index.css" />
<script type="module" crossorigin src="https://g.alicdn.com/aliyun-documentation/web-chatbot-ui/0.0.11/index.js"></script>
<script>
  window.CHATBOT_CONFIG = {
    endpoint: "/chat", // 可以替换为 https://{your-fc-http-trigger-domain}/chat
    displayByDefault: false, // 默认不展示 AI 助手聊天框
    aiChatOptions: { // aiChatOptions 中 options 会传递 aiChat 组件，自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat
      conversationOptions: { // 自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat#conversation-options
        conversationStarters: [
          {prompt: '哪款手机续航最长？'},
          {prompt: '你们有哪些手机型号？'},
          {prompt: '有折叠屏手机吗?'},
        ]
      },
      displayOptions: { // 自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat#display-options
        height: 600,
      },
      personaOptions: { // 自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat#chat-personas
        assistant: {
          name: '你好，我是你的 AI 助手',
          // AI 助手的图标
          avatar: 'https://img.alicdn.com/imgextra/i2/O1CN01Pda9nq1YDV0mnZ31H_!!6000000003025-54-tps-120-120.apng',
          tagline: '您可以尝试点击下方的快捷入口开启体验！',
        }
      }
    }
  };
</script>
<style>
  :root {
    /* webchat 工具栏的颜色 */
    --webchat-toolbar-background-color: #1464E4;
    /* webchat 工具栏文字和按钮的颜色 */
    --webchat-toolbar-text-color: #FFF;
  }
  /* webchat 对话框如果被遮挡，可以尝试通过 z-index、bottom、right 等设置来调整位置 */
  .webchat-container {
    z-index: 100;
    bottom: 10px;
    right: 10px;
  }
  /* webchat 的唤起按钮如果被遮挡，可以尝试通过 z-index、bottom、right 等设置来调整位置 */
  .webchat-bubble-tip {
    z-index: 99;
    bottom: 20px;
    right: 20px;
  }
</style>`
);

主要要更改的是window.CHATBOT_CONFIG.endpoint, 改为触发器的公网访问地址

然后更改avatar，添加浮动的对话泡图标balabala

hexo.extend.injector.register('body_end',`
<link rel="stylesheet" crossorigin href="https://g.alicdn.com/aliyun-documentation/web-chatbot-ui/0.0.11/index.css" />
<script type="module" crossorigin src="https://g.alicdn.com/aliyun-documentation/web-chatbot-ui/0.0.11/index.js"></script>
<script>
  window.CHATBOT_CONFIG = {
    endpoint: "https://webchat-bot-iqu-knzhgrvznd.cn-hangzhou.fcapp.run/chat", // 可以替换为 https://{your-fc-http-trigger-domain}/chat
    displayByDefault: false, // 默认不展示 AI 助手聊天框
    aiChatOptions: { // aiChatOptions 中 options 会传递 aiChat 组件，自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat
      conversationOptions: { // 自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat#conversation-options
        conversationStarters: [
          {prompt: '你是谁？'},
          {prompt: '博主又是谁？'},
          {prompt: '博主喜欢的人是？'},
          {prompt: '想要博主联系方式！'},
        ]
      },
      displayOptions: { // 自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat#display-options
        height: 600,
      },
      personaOptions: { // 自定义取值参考：https://docs.nlkit.com/nlux/reference/ui/ai-chat#chat-personas
        assistant: {          name: '博主的AI助手，十四行诗参上！',
          // AI 助手的图标
          avatar: 'https://chen-yulin.github.io/thumb/14.png',
          tagline: '要不要试试问下面的问题呢？',
        }
      }
    }
  };
</script>
<style>
  :root {
    /* webchat 工具栏的颜色 */
    --webchat-toolbar-background-color: #1464E4;
    /* webchat 工具栏文字和按钮的颜色 */
    --webchat-toolbar-text-color: #FFF;
  }
  /* webchat 对话框如果被遮挡，可以尝试通过 z-index、bottom、left 等设置来调整位置 */
  .webchat-container {
    z-index: 100;
    bottom: 10px;
    right: 10px;
  }
  /* webchat 的唤起按钮如果被遮挡，可以尝试通过 z-index、bottom、left 等设置来调整位置 */
  .webchat-bubble-tip {
    z-index: 99;
    bottom: 20px;
    right: 20px;
  }
  .webchat-bubble-tip {
    overflow: visible !important;
  }
  @keyframes float {
    0% {
      transform: translateY(0px) translateX(-50%);
    }
    50% {
      transform: translateY(-10px) translateX(-50%);
    }
    100% {
      transform: translateY(0px) translateX(-50%);
    }
  }

  .webchat-bubble-tip::before {
    content: '';
    position: absolute;
    top: -25px;
    left: 70%;
    width: 40px;
    height: 40px;
    background-image: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24' fill='white'%3E%3Cpath d='M20 2H4c-1.1 0-2 .9-2 2v18l4-4h14c1.1 0 2-.9 2-2V4c0-1.1-.9-2-2-2zm0 14H6l-2 2V4h16v12z'/%3E%3C/svg%3E");
    background-repeat: no-repeat;
    background-position: center;
    background-size: contain;
    filter: drop-shadow(0 4px 6px rgba(0, 0, 0, 0.5));
    animation: float 3s ease-in-out infinite;
  }
</style>
);

最终效果：

将网站文件同步到数据库：阿里云数据管理
并且在知识索引中进行添加

Posted 2024-10-07Updated 2025-08-15Note11 minutes read (About 1651 words)

Neovim Cursor (avante.nvim)

n仓库地址: https://github.com/yetone/avante.nvim

Accounts

虚拟手机号: https://sms-activate.io/en
（学校邮箱，密码需要大写）

注册Claude-chat
授权Claude-api(https://console.anthropic.com/)

Apply Claude API

Installation

我使用lazy.nvim插件管理器所以非常方便，只需要在lazy_setup.lua里加上

{
  "yetone/avante.nvim",
  event = "VeryLazy",
  lazy = false,
  version = false, -- set this if you want to always pull the latest change
  opts = {
    -- add any opts here
  },
  -- if you want to build from source then do `make BUILD_FROM_SOURCE=true`
  build = "make",
  -- build = "powershell -ExecutionPolicy Bypass -File Build.ps1 -BuildFromSource false" -- for windows
  dependencies = {
    "nvim-treesitter/nvim-treesitter",
    "stevearc/dressing.nvim",
    "nvim-lua/plenary.nvim",
    "MunifTanjim/nui.nvim",
    --- The below dependencies are optional,
    "nvim-tree/nvim-web-devicons", -- or echasnovski/mini.icons
    "zbirenbaum/copilot.lua", -- for providers='copilot'
    {
      -- support for image pasting
      "HakonHarnes/img-clip.nvim",
      event = "VeryLazy",
      opts = {
        -- recommended settings
        default = {
          embed_image_as_base64 = false,
          prompt_for_file_name = false,
          drag_and_drop = {
            insert_mode = true,
          },
          -- required for Windows users
          use_absolute_path = true,
        },
      },
    },
    {
      -- Make sure to set this up properly if you have lazy=true
      'MeanderingProgrammer/render-markdown.nvim',
      opts = {
        file_types = { "markdown", "Avante" },
      },
      ft = { "markdown", "Avante" },
    },
  },
}

lazy安装完毕之后会有报错：
https://github.com/yetone/avante.nvim/issues/612
解决方案为

1 2	cd ~/.local/share/nvim/lazy/avante.nvim make BUILD_FROM_SOURCE=true

配置

添加~/.config/nvim/lua/avante.lua

---NOTE: user will be merged with defaults and
---we add a default var_accessor for this table to config values.

local Utils = require("avante.utils")

---@class avante.CoreConfig: avante.Config
local M = {}

---@class avante.Config
M.defaults = {
	debug = false,
	---@alias Provider "claude" | "openai" | "azure" | "gemini" | "cohere" | "copilot" | [string]
	provider = "claude", -- Only recommend using Claude
	auto_suggestions_provider = "claude",
	---@alias Tokenizer "tiktoken" | "hf"
	-- Used for counting tokens and encoding text.
	-- By default, we will use tiktoken.
	-- For most providers that we support we will determine this automatically.
	-- If you wish to use a given implementation, then you can override it here.
	tokenizer = "tiktoken",
	---@alias AvanteSystemPrompt string
	-- Default system prompt. Users can override this with their own prompt
	-- You can use `require('avante.config').override({system_prompt = "MY_SYSTEM_PROMPT"}) conditionally
	-- in your own autocmds to do it per directory, or that fit your needs.
	system_prompt = [[
Act as an expert software developer.
Always use best practices when coding.
Respect and use existing conventions, libraries, etc that are already present in the code base.
]],
	---@type AvanteSupportedProvider
	openai = {
		endpoint = "https://api.openai.com/v1",
		model = "gpt-4o",
		timeout = 30000, -- Timeout in milliseconds
		temperature = 0,
		max_tokens = 4096,
		["local"] = false,
	},
	---@type AvanteSupportedProvider
	copilot = {
		endpoint = "https://api.githubcopilot.com",
		model = "gpt-4o-2024-05-13",
		proxy = nil, -- [protocol://]host[:port] Use this proxy
		allow_insecure = false, -- Allow insecure server connections
		timeout = 30000, -- Timeout in milliseconds
		temperature = 0,
		max_tokens = 4096,
	},
	---@type AvanteAzureProvider
	azure = {
		endpoint = "", -- example: "https://<your-resource-name>.openai.azure.com"
		deployment = "", -- Azure deployment name (e.g., "gpt-4o", "my-gpt-4o-deployment")
		api_version = "2024-06-01",
		timeout = 30000, -- Timeout in milliseconds
		temperature = 0,
		max_tokens = 4096,
		["local"] = false,
	},
	---@type AvanteSupportedProvider
	claude = {
		endpoint = "https://api.anthropic.com",
		model = "claude-3-5-sonnet-20240620",
		timeout = 30000, -- Timeout in milliseconds
		temperature = 0,
		max_tokens = 8000,
		["local"] = false,
	},
	---@type AvanteSupportedProvider
	gemini = {
		endpoint = "https://generativelanguage.googleapis.com/v1beta/models",
		model = "gemini-1.5-flash-latest",
		timeout = 30000, -- Timeout in milliseconds
		temperature = 0,
		max_tokens = 4096,
		["local"] = false,
	},
	---@type AvanteSupportedProvider
	cohere = {
		endpoint = "https://api.cohere.com/v1",
		model = "command-r-plus-08-2024",
		timeout = 30000, -- Timeout in milliseconds
		temperature = 0,
		max_tokens = 4096,
		["local"] = false,
	},
	---To add support for custom provider, follow the format below
	---See https://github.com/yetone/avante.nvim/README.md#custom-providers for more details
	---@type {[string]: AvanteProvider}
	vendors = {},
	---Specify the behaviour of avante.nvim
	---1. auto_apply_diff_after_generation: Whether to automatically apply diff after LLM response.
	---                                     This would simulate similar behaviour to cursor. Default to false.
	---2. auto_set_keymaps                : Whether to automatically set the keymap for the current line. Default to true.
	---                                     Note that avante will safely set these keymap. See https://github.com/yetone/avante.nvim/wiki#keymaps-and-api-i-guess for more details.
	---3. auto_set_highlight_group        : Whether to automatically set the highlight group for the current line. Default to true.
	---4. support_paste_from_clipboard    : Whether to support pasting image from clipboard. This will be determined automatically based whether img-clip is available or not.
	behaviour = {
		auto_suggestions = false, -- Experimental stage
		auto_set_highlight_group = true,
		auto_set_keymaps = true,
		auto_apply_diff_after_generation = false,
		support_paste_from_clipboard = false,
	},
	history = {
		storage_path = vim.fn.stdpath("state") .. "/avante",
		paste = {
			extension = "png",
			filename = "pasted-%Y-%m-%d-%H-%M-%S",
		},
	},
	highlights = {
		---@type AvanteConflictHighlights
		diff = {
			current = "DiffText",
			incoming = "DiffAdd",
		},
	},
	mappings = {
		---@class AvanteConflictMappings
		diff = {
			ours = "co",
			theirs = "ct",
			all_theirs = "ca",
			both = "cb",
			cursor = "cc",
			next = "]x",
			prev = "[x",
		},
		suggestion = {
			accept = "<M-l>",
			next = "<M-]>",
			prev = "<M-[>",
			dismiss = "<C-]>",
		},
		jump = {
			next = "]]",
			prev = "[[",
		},
		submit = {
			normal = "<CR>",
			insert = "<C-s>",
		},
		-- NOTE: The following will be safely set by avante.nvim
		ask = "<leader>aa",
		edit = "<leader>ae",
		refresh = "<leader>ar",
		toggle = {
			default = "<leader>at",
			debug = "<leader>ad",
			hint = "<leader>ah",
			suggestion = "<leader>as",
		},
		sidebar = {
			switch_windows = "<Tab>",
			reverse_switch_windows = "<S-Tab>",
		},
	},
	windows = {
		---@alias AvantePosition "right" | "left" | "top" | "bottom"
		position = "right",
		wrap = true, -- similar to vim.o.wrap
		width = 30, -- default % based on available width in vertical layout
		height = 30, -- default % based on available height in horizontal layout
		sidebar_header = {
			align = "center", -- left, center, right for title
			rounded = true,
		},
		input = {
			prefix = "> ",
		},
		edit = {
			border = "rounded",
		},
	},
	--- @class AvanteConflictConfig
	diff = {
		autojump = true,
	},
	--- @class AvanteHintsConfig
	hints = {
		enabled = true,
	},
}

---@type avante.Config
M.options = {}

---@class avante.ConflictConfig: AvanteConflictConfig
---@field mappings AvanteConflictMappings
---@field highlights AvanteConflictHighlights
M.diff = {}

---@type Provider[]
M.providers = {}

---@param opts? avante.Config
function M.setup(opts)
	vim.validate({ opts = { opts, "table", true } })

	M.options = vim.tbl_deep_extend(
		"force",
		M.defaults,
		opts or {},
		---@type avante.Config
		{
			behaviour = {
				support_paste_from_clipboard = M.support_paste_image(),
			},
		}
	)
	M.providers = vim.iter(M.defaults)
		:filter(function(_, value)
			return type(value) == "table" and value.endpoint ~= nil
		end)
		:fold({}, function(acc, k)
			acc = vim.list_extend({}, acc)
			acc = vim.list_extend(acc, { k })
			return acc
		end)

	vim.validate({ provider = { M.options.provider, "string", false } })

	M.diff = vim.tbl_deep_extend(
		"force",
		{},
		M.options.diff,
		{ mappings = M.options.mappings.diff, highlights = M.options.highlights.diff }
	)

	if next(M.options.vendors) ~= nil then
		for k, v in pairs(M.options.vendors) do
			M.options.vendors[k] = type(v) == "function" and v() or v
		end
		vim.validate({ vendors = { M.options.vendors, "table", true } })
		M.providers = vim.list_extend(M.providers, vim.tbl_keys(M.options.vendors))
	end
end

---@param opts? avante.Config
function M.override(opts)
	vim.validate({ opts = { opts, "table", true } })

	M.options = vim.tbl_deep_extend("force", M.options, opts or {})
	M.diff = vim.tbl_deep_extend(
		"force",
		{},
		M.options.diff,
		{ mappings = M.options.mappings.diff, highlights = M.options.highlights.diff }
	)

	if next(M.options.vendors) ~= nil then
		for k, v in pairs(M.options.vendors) do
			M.options.vendors[k] = type(v) == "function" and v() or v
			if not vim.tbl_contains(M.providers, k) then
				M.providers = vim.list_extend(M.providers, { k })
			end
		end
		vim.validate({ vendors = { M.options.vendors, "table", true } })
	end
end

M = setmetatable(M, {
	__index = function(_, k)
		if M.options[k] then
			return M.options[k]
		end
	end,
})

M.support_paste_image = function()
	return Utils.has("img-clip.nvim") or Utils.has("img-clip")
end

M.get_window_width = function()
	return math.ceil(vim.o.columns * (M.windows.width / 100))
end

---@param provider Provider
---@return boolean
M.has_provider = function(provider)
	return M.options[provider] ~= nil or M.vendors[provider] ~= nil
end

---get supported providers
---@param provider Provider
---@return AvanteProviderFunctor
M.get_provider = function(provider)
	if M.options[provider] ~= nil then
		return vim.deepcopy(M.options[provider], true)
	elseif M.vendors[provider] ~= nil then
		return vim.deepcopy(M.vendors[provider], true)
	else
		error("Failed to find provider: " .. provider, 2)
	end
end

M.BASE_PROVIDER_KEYS = {
	"endpoint",
	"model",
	"deployment",
	"api_version",
	"proxy",
	"allow_insecure",
	"api_key_name",
	"timeout",
	-- internal
	"local",
	"_shellenv",
	"tokenizer_id",
	"use_xml_format",
}

return M

~/.config/nvim/init.lua中添加

1	require("plugin-config.avante").setup()

在~/.bashrc添加api key

1	export ANTHROPIC_API_KEY=*** #码了

Fix Error

Error #1

运行时出现这个报错

参见 https://github.com/yetone/avante.nvim/pull/560/files

由于这个PR没有被merge所以决定fork一下这个仓库再应用更改

push到远程仓库后记得release一下

然后将该plugin的仓库地址改为我自己的

成功消除了这个报错 > 现已切换回原仓库的main branch

Error #2

没有回复访问 https://console.anthropic.com/settings/plans 发现是没有Funds（说好的注册送5刀的额度的呢。。。）

使用WildCard服务 https://bewildcard.com/card ，具体账密见密码簿

Error #3

发现服务器无响应

设置终端proxy export https_proxy=127.0.0.1:7897
尝试输入

curl https://api.anthropic.com/v1/messages \                                                                                                   
  -H "x-api-key: <api-key>" \
  -H "anthropic-version: 2023-06-01" \
  -H "content-type: application/json" \
  -d '{
    "model": "claude-3-sonnet-20240229",
    "max_tokens": 1024,
    "messages": [
      {
        "role": "user",
        "content": "What is the capital of France?"
      }
    ]
  }'

sudo conda update curl to solve curl(77) error
export CURL_CA_BUNDLE=/etc/ssl/certs/ca-certificates.crt

Error #4

更新编译报错

ISSUE: https://github.com/yetone/avante.nvim/issues/612#issuecomment-2375729928

cd ~/.local/share/nvim/lazy/avante.nvim
make # or BUILD_FROM_SOURCE=true make
ls ./build/
avante_html2md.so  avante_repo_map.so  avante_templates.so  avante_tokenizers.so

Posted 2024-09-26Updated 2025-08-15Note3 minutes read (About 382 words)

论文工作流

参考: https://forum-zh.obsidian.md/t/topic/292

文献搜集网站

1

Google Scholar 114

2

Connected Papers

总结了各文章之间的关联度，太直观了

3

Citexs Paperpicky
领域文献调研分析网络图将推荐文献可视化，在这里可以很容易看到文献之间的引用和被引用的关系。每个圆球代表一篇文章，圆球大小与共引次数正相关，从AI推荐文献、经典文献、核心文献三个维度给大家推荐出关联文献。
（1）AI推荐文献是根据所输入的关键词，智能筛选出与关键词高度关联的文献，在这里大家可以看到近期发表的研究成果；
（2）经典文献是基于推荐文献共引用关系筛选出的文献，一般都是该领域的必读文献，也就是说只要你研究该领域，这些文献肯定要读；
（3）核心文献则是AI推荐文献和经典文献重叠共引的部分。

文献管理软件

Zotero

参考: https://sspai.com/post/56724 安装了`Firefox-Zotero`插件，可以直接捕获网页内容并存到Zotero Liberary中（打开pdf界面后直接点击插件按钮即可保存）保存后Zotero会自行抓取论文关键信息并重新生成标题

Posted 2024-09-26Updated 2025-08-15Notea minute read (About 200 words)

ESP32-CAM Video Stream

参考仓库: https://github.com/rzeldent/esp32cam-rtsp

Objective

Stream video through wifi using ESP-32-CAM.
The video sources can be accessed by an ip address.

Implemenation

Install PlatformIO plugin in vscode.
Clone the repository

1	git clone --recursive https://github.com/rzeldent/esp32cam-rtsp.git

Use vscode pio to open the project. Wait pio till its configuration is done.

Change the default_envs settings, here I use esp32cam_ai_thinker

If no default_envs is specified, pio will build the project for all platforms

Here we can build and upload the program to the board.

Connect to ESP**** WiFi, and visit http://192.168.4.1 to configure the wifi settings of the board. I choose to my phone’s hot spot.

Then open the monitor to check the ip address (you need to connect your computer to the same LAN (local area network) to visit the ip)

Visit the 192.168.23.142 and you will see the page (similar with the page of http://192.168.4.1):

Click rtsp://192.168.23.197:554/mjpeg/1 and you will see the streaming video:

生无可恋做横向.jpg

Posted 2024-09-23Updated 2025-08-15Notea minute read (About 223 words)

ESP32 Development on Linux

Standard Toolchain Setup

Guide: https://docs.espressif.com/projects/esp-idf/en/stable/esp32/get-started/linux-macos-setup.html#

ESP_IDF

I directly download the archive with all the submodules included: https://github.com/espressif/esp-idf/releases/tag/v5.3.1

This archive can also be downloaded from Espressif’s download server: https://dl.espressif.com/github_assets/espressif/esp-idf/releases/download/v5.3.1/esp-idf-v5.3.1.zip

cd into the unzip folder (The installation will fail in conda venv)

1
2
3

conda deactivate
./install.sh
. ./export.sh

In ~/.zshrc

1	alias get_idf='. $HOME/esp/esp-idf-v5.3.1/export.sh'

Then each time I need to setup esp32 development environment, I only need to type get_idf.

Some useful commands

idf.py set-target esp32
idf.py menuconfig
idf.py build
idf.py -p /dev/ttyUSB0 -b 115200 flash
idf.py -p /dev/ttyUSB0 -b monitor
idf.py -p /dev/ttyUSB0 flash monitor # combine together

PlatformIO

事实证明pio是最方便的。。
事前安装过pio的vscode插件，直接打开pio的esp32项目就直接可以编译上传以及查看串口监视器。

Posted 2024-09-20Updated 2025-08-15Notea few seconds read (About 65 words)

Shader Learning Materials

一款线上还在更新中的shader教学，有可交互代码，爆赞
https://thebookofshaders.com/

一些dalao写的shader：
https://www.shadertoy.com/

Shader Tutorials by Ronja
https://www.ronja-tutorials.com/
https://github.com/ronja-tutorials/ShaderTutorials?tab=readme-ov-file

Shader Tutorial For Beginner
https://github.com/Xibanya/ShaderTutorials

Posted 2024-09-16Updated 2025-08-15Notea few seconds read (About 90 words)

live2d 复活计划

因为oh-my-live2d官网崩了，所以需要自己私有托管一下live2d服务才能让博客网上的小埋复活。(bonus:可以借此尝试自己制作一个live2d形象)
更于9/25，拖着很久没搞，然后官网自己复活了，好好好，直接快进到customized live2d。

Posted 2024-09-03Updated 2025-08-15Note4 minutes read (About 566 words)

Hexo Comment System -- Twikoo

因为之前使用gitalk评论系统在国内用不了了，所以尝试更换为一些国内可用的方案。其中一个是国内开发者开发@imaegoo搭建的Twikoo系统

> 拜一下，我的博客网也是改自他的魔改版hexo-icarus主题

对，然后针对icarus系的主题，官网有提供详细例程
主要分为云函数部署部分和前端配置部分

云函数

归根结底就是为静态的博客网提供一个可以响应用户数据提交和显示的服务。
主要分为两块

Mongodb: 用于云存储用户评论数据
Huggingface: 用于部署一个云服务器，响应用户需求，与db交互。

那么首先就需要去mongodb申请一个数据库，依照这个例程： https://twikoo.js.org/mongodb-atlas.html

> 密码保密)

随后huggingface的服务就通过这串链接字符串来与这个数据库交互

随后去huggingface开启云服务器: https://twikoo.js.org/backend.html#hugging-face-%E9%83%A8%E7%BD%B2
我的云仓库为： https://huggingface.co/spaces/CallMeChen/BlogComment
启动服务成功后可以看到如下log:

前端配置

参照 https://www.anzifan.com/post/icarus_to_candy_2

关于评论头像

访客还可以通过输入数字 QQ 邮箱地址，使用 QQ 头像发表评论。

后续

添加图床方便上传图片(已解决)

维护

10/20发现评论系统不能用了，发现是mongodb暂停了数据库服务，想起来它应该是每个月都需要自己connect一下（也算是释放不活跃用户的资源吧）。

每周维护的时候需要进到 https://cloud.mongodb.com/v2/6610ebcc85d7126b38c3837b#/overview connect一下

随后发现huggingface的仓库一直卡在build，遂更换为Netlify云函数平台。
https://app.netlify.com/sites/admirable-sunburst-4762e3/configuration/general

这个仓库的函数会把评论存贮在test/comment下，和之前的twikoo/comment不同，导致历史评论无法显示，所以在twikoo/comment下使用aggregation **$merge**

1
2
3

{
     into: { db: "test", coll: "comment" },
}

Posted 2024-08-05Updated 2025-08-15Note12 minutes read (About 1785 words)

ECE3730J Final RC Part 1

L2 (Embedded system overview)

Take a look at the differences between Microprocessor (in our personal computer or phone) and Microcontroller

### Von Neumann vs Harvard architecture

efficiency: Harvard architecture can avoid the "Von Neumann bottleneck" > VERBOSE~ **Von Neumann bottleneck**: when the bandwidth between CPU and RAM is much lower than the speed at which a typical CPU can process data, because the shared bus for instructions and data can cause competition.

In embedded system, harvard architecture is widely used.
Our board (STM32F103C8T6) use harvard architecture on physical level (refer to the block diagram in reference manual)

>*VERBOSE~* **ICode bus**:This bus connects the Instruction bus of the Cortex®-M3 core to the Flash memory instruction interface. **DCode bus**:This bus connects the DCode bus (literal load and debug access) of the Cortex ®-M3 core to the Flash memory Data interface.

However, in the software level, we treat the instruction memory and data memory as a whole block of memory (therefore, it is more accurate to say that the stm32 uses a mixed Harvard and von Neumann architecture.).

In stm32, instruction memory, data memory, registers of peripherals/IO are all mapped to memory.

Table from https://embeddedsecurity.io/vendor-stm32

L3 (Programming)

Type Qualifiers

`const`

implies that value not supposed to be written by program (read only) during run-time. Can be modified by others like hardware.

If you want to save the limited RAM spaces (data memory) for other variables, you can use this keyword to store this variable in ROM (program memory). It’s important for harvard architecture.

`volatile`

indicate the value can be changed by something other than program so it should be reexamined frequently.
This means two things:
- The compiler will not try to optimize the variable with volatile. See the two examples on slides.
- Each time the program reads the volatile variable, the processor will not look into cached data memory, meaning that the program can always get the newest updated data in memory (which is very important when external hardware change the variable). ~~However, this case is not relevant with STM32 MCU since it didn’t have cache.~~

What about `const volatile` ?

The combination of the above two concepts. Usually used to declare pointers
Example:
const volatile char *a declares a pointer pointing to a value that cannot be changed by the program through *a, but the value of a can be changed (pointing to another value). *a = 0 is not allowed, a = &b is allowed.

Generally, we use const volatile to declare pointers that points to hardware registers or memory-mapped Input ports(read only).

Basic Program Structure

// import header file for the board (containing the declaration of SFR)
// declare global variables
int main(void){
	// initialization (system clock, peripherals)
	while(true) // super loop, the the program alive
	{
		// interact with peripherals
	}

}

How to interact with peripherals

We need to use C code to set the value of SFR.

SFR (special function registers)

These are registers that are embedded in peripherals, used for configuration and control of peripherals.

VERBOSE~
If we want to get the status of a peripheral, we read the value of SFR.
If we want to send something to peripheral, we write value to SFR.

Let’s take timer as an example:
SFR in block diagram of timer:

SFR declaration in code:

We change operate with these registers through Bit Operation.

(These are PIC32 codes but I think you have got it)

L4 (IO)

All the modes of GPIO:

The whole block diagram:

You may see an unfamiliar unit

Here.
It’s used to convert the input analog voltage to digital voltage.
You can memorize it through this graph:
Transfer function of a Schmitt trigger.

>The horizontal and vertical axes are input voltage and output voltage, respectively. T and −T are the switching thresholds, and M and −M are the output voltage levels.

Input

After we configured the GPIO to be input ports, the output driver is disabled (disconnected).

Pull down

- When the IO pin is connected to LOW (0V) or unconnected, the input data register will be 0. - When the IO pin is connected to HIGH (3.3V/5V), the input data register will be 1.

Pull up

- When the IO pin is connected to LOW (0V), the input data register will be 0. - When the IO pin is connected to HIGH (3.3V/5V) or unconnected, the input data register will be 1.

Floating

- When the IO pin is connected to LOW (0V), the input data register will be 0. - When the IO pin is connected to HIGH (VDD), the input data register will be 1. - When the IO pin is unconnected, the input data register will be unpredictable.

General Purpose Output

The input driver part is still enabled so that we can read the output status.

Open Drain

Can “generate” voltage higher than VDD at IO pin.

- “0” in the Output register activates the N-MOS (LOW (0V) at IO pin) - “1” in the Output register leaves the port in Hi-Z (HIGH (V+) at IO pin) (the P-MOS is never activated)

Push Pull

Most common one.

- “0” in the Output register activates the N-MOS (LOW (0V) at IO pin) - “1” in the Output register activates the P-MOS (HIGH (VDD) at IO pin)

Alternative Function Output

Not covered.

L5 (Interrupts)

Why interrupt?

Most of the peripherals take quite a few time to complete its task or trigger an event. Instead of instruct the processor to keep checking the status of these peripheral (polling), we want the peripherals to inform processor when there exists an event, so that the processor can focus on its own task.

Peripherals inform the processor through external interrupt.

Where do interrupts come from

We mainly deal with the interrupts from peripherals.

Each peripheral can have multiple interrupt sources, indicating different events.

How to handle interrupts

Through interrupt service routine (ISR)

#### Interrupt vectors Interrupt vectors are **addresses that inform the interrupt handler as to where to find the ISR**

| Vector Number | Interrupt Number | Description                    | Vector Address   |
|---------------|------------------|--------------------------------|------------------|
| 0             | -                | Initial Stack Pointer          | 0x0800 0000      |
| 1             | -                | Reset Handler                  | 0x0800 0004      |
....
| 27            | 11               | DMA1 Channel1 global Interrupt | 0x0800 006C      |
| 28            | 12               | DMA1 Channel2 global Interrupt | 0x0800 0070      |
| 29            | 13               | DMA1 Channel3 global Interrupt | 0x0800 0074      |
| 30            | 14               | DMA1 Channel4 global Interrupt | 0x0800 0078      |
| 31            | 15               | DMA1 Channel5 global Interrupt | 0x0800 007C      |
| 32            | 16               | DMA1 Channel6 global Interrupt | 0x0800 0080      |
| 33            | 17               | DMA1 Channel7 global Interrupt | 0x0800 0084      |
...
| 41            | 25               | TIM1 Break Interrupt           | 0x0800 00A4      |
| 42            | 26               | TIM1 Update Interrupt          | 0x0800 00A8      |
| 43            | 27               | TIM1 Trigger and Commutation   | 0x0800 00AC      |
| 44            | 28               | TIM1 Capture Compare Interrupt | 0x0800 00B0      |
...

This is an arbitrary IVT that depicts the pattern of IVT, for detailed IVT of STM32, please refer to the reference manual.

What to do inside ISR

Always remember to clear the interrupt flag in ISR.
Because there is often more interrupt sources than interrupt vectors, you need to judge which source triggered this interrupt based on interrupt status register.
Customized operation… (but be short, because if the operation take too many clock cycles, it may be interrupted by another interrupt source, which may not be on purpose)

Nested interrupts

When executing ISR, the processor can be interrupted by interrupts with higher priority. > **Remainder:** the lower the priority number, the higher the priority

L6 (Timer)

Let’s split Timer peripheral into 3 parts:

#### Blue part: Master/slave controller The master/slave unit provides the time-base unit with the **counting clock signal** (for example the CK_PSC signal, PSC here means that it's for the prescaler in time-base unit), as well as the counting direction (counting up/down) control signal. This unit mainly provides the **control signals** for the time-base unit. #### Yellow part: Time-base unit The main block of the programmable timer is a **16-bit** counter with its related auto-reload register. The counter can count up, down or both up and down. The counter clock can be divided by a **prescaler** (which is basically another counter).

> On the reference manual there are many wave-forms for you to understand how these control registers take effects.

The reset frequency of the counter is $$\frac{f_{input}}{(Prescaler+1)\times(Counter Period+1)}$$

Red part: Timer-channels unit

The timer channels are the working elements of the timer.
They are the means by which a timer peripheral interacts with its external environment (through input capture or output compare).

How to use multiple timer for more counting digits (32-bit)

It is possible to configure one slave timer to increment its counter based on a master-timer events such as the timer update event. The master-timer event is signaled by the master timer master/slave controller unit. This controlling unit uses the master timer output-TRGO signal. The master timer output-TRGO signal is connected to the slave timer TRGI-input signal. The master/slave controller unit of the slave timer is configured to use the TRGI-input signal as clock source to increment the slave timer counter.

L7 (LCD)

Just refer to RC2_LCD. I think it’s not the focus of final exam.

L9 & L10 (Input Capture & Output Compare)

IC is a peripheral that can monitor the input signal changes (pos/neg edge) independent of the processor (Core).
OC is a peripheral that can generate precise output signal independent of the processor (Core).

In STM32, it is embedded in timer peripheral (together with output compare).

IC

You can consider IC as a timer value recorder. It will record the timer value each time the capture condition is met (you can see from the diagram, the timer value is from CNT counter).

These conditions can be:

rising edge

falling edge

both
With a prescaler, we trigger capture events every few edges.
Similar for interrupt, we can trigger an interrupt every few captures.

Note:
The IC does not capture the edge immediately when a rising or falling edge happened. The capture event needs to be sync with PB_clk.
Further more, the module will capture the timer counter value that is valid 2-3 PB_clk cycles after the capture event.

For detailed configuration, please refer to Reference Manual.pdf on canvas, page 349-359, 382-385.

OC

Just refer to RC2_Output_Campare and RC3_Lab4 for the concepts and PWM configuration.
Also, the solution of hw2 has been uploaded to canvas, please take a look.