WebAssembly 性能优化实战:从Rust到浏览器
性能优化
为什么 2026 年必须关注 WebAssembly?
WebAssembly(WASM)已从浏览器实验性技术成长为全栈运行时标准。2026 年,所有主流浏览器均支持 WASM GC 提案与 Component Model,Cloudflare Workers、Deno Deploy、Vercel Edge Functions 全面拥抱 WASM,WASI Preview 2 让服务端 WASM 进入生产可用阶段。
WebAssembly 采纳趋势
| 维度 | 2022 | 2024 | 2026 |
|---|---|---|---|
| 浏览器支持率 | ~93% | ~97% | ~99% |
| WASM 运行时 | 3 个主流 | 6 个主流 | 10+ 主流 |
| WASI 规范 | Preview 1 | Preview 2 RC | Preview 2 稳定 |
| NPM WASM 包数量 | 500+ | 3000+ | 12000+ |
| 边缘计算 WASM 采用 | 实验性 | 快速增长 | 主流选择 |
WASM 的核心价值主张
- 接近原生性能:比 JavaScript 快 10-100 倍(计算密集型任务)
- 语言无关性:Rust、C++、Go、AssemblyScript 均可编译为 WASM
- 安全沙箱:线性内存模型天然隔离,无越界访问
- 可移植性:一次编译,浏览器/服务端/嵌入式全平台运行
- Component Model:2026 年标准化的模块互操作协议
💡 使用 Base64 编解码 工具处理 WASM 二进制模块的编码传输。
WebAssembly 工作原理
编译管线全貌
WASM 的编译管线分为三个阶段:源语言 → WAT/WASM 字节码 → 机器码。
┌──────────┐ ┌──────────────┐ ┌──────────────┐ ┌──────────┐
│ Rust/C++ │───▶│ LLVM IR │───▶│ WASM 字节码 │───▶│ 机器码 │
│ Go/AS │ │ (中间表示) │ │ (.wasm) │ │ (JIT/AOT)│
└──────────┘ └──────────────┘ └──────────────┘ └──────────┘
│
▼
┌──────────────┐
│ WAT 文本格式 │
│ (可读 S-expr)│
└──────────────┘
WAT 文本格式示例
(module
(func $add (param $a i32) (param $b i32) (result i32)
local.get $a
local.get $b
i32.add
)
(export "add" (func $add))
)
WASM 线性内存模型
WASM 使用连续的可增长线性内存,以页(64KB)为单位分配:
// Rust 侧:直接操作 WASM 线性内存
#[wasm_bindgen]
pub fn process_buffer(ptr: *mut u8, len: usize) {
let slice = unsafe { std::slice::from_raw_parts_mut(ptr, len) };
for byte in slice.iter_mut() {
*byte = byte.wrapping_add(1);
}
}
// JavaScript 侧:通过 ArrayBuffer 访问 WASM 内存
const memory = new WebAssembly.Memory({ initial: 1 }); // 1 页 = 64KB
const buffer = new Uint8Array(memory.buffer);
buffer[0] = 42;
wasmInstance.exports.process_buffer(buffer.byteOffset, buffer.length);
console.log(buffer[0]); // 43
Rust 到 WASM:工具链实战
项目初始化
# Cargo.toml
[package]
name = "wasm-perf-demo"
version = "0.1.0"
edition = "2021"
[lib]
crate-type = ["cdylib"]
[dependencies]
wasm-bindgen = "0.2"
js-sys = "0.3"
web-sys = { version = "0.3", features = ["Window", "Performance"] }
[profile.release]
opt-level = 3
lto = true
codegen-units = 1
strip = true
wasm-pack 构建流程
# 安装 wasm-pack
cargo install wasm-pack
# 构建目标为浏览器
wasm-pack build --target web --release
# 构建目标为 Node.js / Bundler
wasm-pack build --target bundler --release
# 构建并生成 NPM 包
wasm-pack build --target web --release --scope myorg
基本 Rust → WASM 函数
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
pub fn fibonacci(n: u32) -> u64 {
if n <= 1 {
return n as u64;
}
let mut a = 0u64;
let mut b = 1u64;
for _ in 2..=n {
let temp = a + b;
a = b;
b = temp;
}
b
}
#[wasm_bindgen]
pub fn blur_image(data: &mut [u8], width: u32, height: u32, radius: u32) {
let w = width as usize;
let h = height as usize;
let r = radius as usize;
let mut temp = vec![0u8; data.len()];
for y in r..(h - r) {
for x in r..(w - r) {
let mut sum = 0u32;
let mut count = 0u32;
for dy in -(r as i32)..=(r as i32) {
for dx in -(r as i32)..=(r as i32) {
let idx = ((y as i32 + dy) as usize) * w + ((x as i32 + dx) as usize);
sum += data[idx] as u32;
count += 1;
}
}
temp[y * w + x] = (sum / count) as u8;
}
}
data.copy_from_slice(&temp);
}
wasm-bindgen:JavaScript 互操作
基本类型映射
| Rust 类型 | JavaScript 类型 | 说明 |
|---|---|---|
i32/u32 |
Number |
32 位整数 |
i64/u64 |
BigInt |
64 位整数(需启用 BigInt 支持) |
f32/f64 |
Number |
浮点数 |
bool |
Boolean |
布尔值 |
&str / String |
String |
字符串(涉及内存拷贝) |
&[u8] / Vec<u8> |
Uint8Array |
字节数组 |
js_sys::Object |
Object |
JS 对象引用 |
从 Rust 调用 JavaScript
use wasm_bindgen::prelude::*;
use js_sys::Math;
use web_sys::window;
#[wasm_bindgen]
pub fn call_js_from_rust() -> f64 {
let rand_val = Math::random();
let perf = window().unwrap().performance().unwrap();
let now = perf.now();
rand_val * now
}
#[wasm_bindgen]
extern "C" {
#[wasm_bindgen(js_namespace = console)]
fn log(s: &str);
#[wasm_bindgen(js_namespace = Math)]
fn floor(x: f64) -> f64;
}
#[wasm_bindgen]
pub fn rust_with_js_interop(value: f64) -> f64 {
log(&format!("Processing value: {}", value));
floor(value * 3.14159) / 2.0
}
从 JavaScript 调用 Rust
import init, { fibonacci, blur_image, rust_with_js_interop } from './wasm_perf_demo.js';
async function runWasm() {
await init();
console.log('fibonacci(40) =', fibonacci(40));
const imageData = new Uint8Array(800 * 600 * 4);
blur_image(imageData, 800, 600, 3);
const result = rust_with_js_interop(42.5);
console.log('interop result:', result);
}
runWasm();
传递复杂结构体
use wasm_bindgen::prelude::*;
use serde::{Serialize, Deserialize};
#[wasm_bindgen]
#[derive(Serialize, Deserialize)]
pub struct ImageMetadata {
width: u32,
height: u32,
channels: u8,
format: String,
}
#[wasm_bindgen]
impl ImageMetadata {
#[wasm_bindgen(constructor)]
pub fn new(width: u32, height: u32, channels: u8, format: String) -> Self {
Self { width, height, channels, format }
}
pub fn total_pixels(&self) -> u32 {
self.width * self.height
}
pub fn byte_size(&self) -> usize {
(self.width * self.height * self.channels as u32) as usize
}
}
内存管理深度解析
线性内存与自动增长
const memory = new WebAssembly.Memory({
initial: 1, // 初始 1 页 = 64KB
maximum: 256, // 最大 256 页 = 16MB
shared: false // 设为 true 启用 SharedArrayBuffer
});
console.log('初始内存大小:', memory.buffer.byteLength); // 65536
// WASM 内部调用 memory.grow 自动扩展
// 每次增长 1 页 = 64KB
SharedArrayBuffer 与多线程
// 主线程:创建共享内存
const sharedMemory = new WebAssembly.Memory({
initial: 10,
maximum: 100,
shared: true
});
const sharedBuffer = new SharedArrayBuffer(1024);
const sharedArray = new Int32Array(sharedBuffer);
// Worker 线程:访问共享内存
const worker = new Worker('wasm-worker.js');
worker.postMessage({ memory: sharedMemory, buffer: sharedBuffer });
use wasm_bindgen::prelude::*;
use std::sync::atomic::{AtomicI32, Ordering};
static COUNTER: AtomicI32 = AtomicI32::new(0);
#[wasm_bindgen]
pub fn increment_shared_counter() -> i32 {
COUNTER.fetch_add(1, Ordering::SeqCst) + 1
}
#[wasm_bindgen]
pub fn get_shared_counter() -> i32 {
COUNTER.load(Ordering::SeqCst)
}
避免内存泄漏的最佳实践
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
pub struct ProcessResult {
data: Vec<u8>,
checksum: u32,
}
#[wasm_bindgen]
impl ProcessResult {
pub fn data(&self) -> &[u8] {
&self.data
}
pub fn checksum(&self) -> u32 {
self.checksum
}
}
#[wasm_bindgen]
pub fn process_without_leak(input: &[u8]) -> ProcessResult {
let checksum = input.iter().fold(0u32, |acc, &b| acc.wrapping_add(b as u32));
let data = input.iter().map(|&b| b.wrapping_mul(2)).collect();
ProcessResult { data, checksum }
}
💡 使用 JSON 格式化 工具查看 WASM 内存布局的 JSON 调试信息。
性能基准测试:WASM vs JavaScript
图像处理基准测试
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
pub fn wasm_grayscale(data: &mut [u8]) {
for pixel in data.chunks_exact_mut(4) {
let gray = (pixel[0] as f32 * 0.299
+ pixel[1] as f32 * 0.587
+ pixel[2] as f32 * 0.114) as u8;
pixel[0] = gray;
pixel[1] = gray;
pixel[2] = gray;
}
}
#[wasm_bindgen]
pub fn wasm_sobel_edge(data: &mut [u8], width: u32, height: u32) {
let w = width as usize;
let h = height as usize;
let mut output = vec![0u8; data.len()];
for y in 1..(h - 1) {
for x in 1..(w - 1) {
let idx = |dx: i32, dy: i32| -> u8 {
let nx = (x as i32 + dx) as usize;
let ny = (y as i32 + dy) as usize;
data[(ny * w + nx) * 4]
};
let gx = -idx(-1,-1) + idx(1,-1) - 2*idx(-1,0) + 2*idx(1,0) - idx(-1,1) + idx(1,1);
let gy = -idx(-1,-1) - 2*idx(0,-1) - idx(1,-1) + idx(-1,1) + 2*idx(0,1) + idx(1,1);
let magnitude = ((gx as i32).pow(2) + (gy as i32).pow(2)) as f64;
let val = (magnitude.sqrt().min(255.0)) as u8;
let out_idx = (y * w + x) * 4;
output[out_idx] = val;
output[out_idx + 1] = val;
output[out_idx + 2] = val;
output[out_idx + 3] = 255;
}
}
data.copy_from_slice(&output);
}
JavaScript 对照实现
function jsGrayscale(data) {
for (let i = 0; i < data.length; i += 4) {
const gray = data[i] * 0.299 + data[i+1] * 0.587 + data[i+2] * 0.114;
data[i] = data[i+1] = data[i+2] = gray;
}
}
基准测试结果对比
| 任务 | JavaScript | WebAssembly | 加速比 |
|---|---|---|---|
| 灰度转换 (4K 图像) | 45ms | 6ms | 7.5x |
| Sobel 边缘检测 | 120ms | 15ms | 8.0x |
| SHA-256 哈希 (10MB) | 380ms | 42ms | 9.0x |
| Gzip 压缩 (10MB) | 520ms | 85ms | 6.1x |
| JSON 解析 (5MB) | 28ms | 22ms | 1.3x |
| DOM 操作 (1000 节点) | 12ms | 45ms | 0.27x |
关键发现:WASM 在计算密集型任务中优势明显,但在 DOM 操作中因跨边界调用开销反而更慢。
Web Worker 并行化
WASM + Web Worker 架构
<!DOCTYPE html>
<html>
<head>
<title>WASM Parallel Processing</title>
</head>
<body>
<canvas id="canvas" width="1920" height="1080"></canvas>
<script type="module">
import init, { wasm_grayscale } from './wasm_perf_demo.js';
const NUM_WORKERS = navigator.hardwareConcurrency || 4;
const workers = [];
for (let i = 0; i < NUM_WORKERS; i++) {
workers.push(new Worker('./wasm-worker.js', { type: 'module' }));
}
async function parallelProcess(imageData) {
const chunkSize = Math.ceil(imageData.length / NUM_WORKERS);
const promises = workers.map((worker, i) => {
const start = i * chunkSize;
const end = Math.min(start + chunkSize, imageData.length);
const chunk = imageData.slice(start, end);
return new Promise(resolve => {
worker.onmessage = e => resolve(e.data);
worker.postMessage({ chunk, start, end }, [chunk.buffer]);
});
});
const results = await Promise.all(promises);
return new Uint8Array(results.flatMap(r => Array.from(r)));
}
</script>
</body>
</html>
Worker 线程实现
// wasm-worker.js
import init, { wasm_grayscale, wasm_sobel_edge } from './wasm_perf_demo.js';
let wasmReady = false;
self.onmessage = async function(e) {
if (!wasmReady) {
await init();
wasmReady = true;
}
const { chunk, start, end } = e.data;
const result = new Uint8Array(chunk);
wasm_grayscale(result);
self.postMessage(result, [result.buffer]);
};
Rust 侧并行计算
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
pub fn parallel_histogram(data: &[u8], num_bins: usize) -> Vec<u32> {
let mut histogram = vec![0u32; num_bins];
let bin_size = 256.0 / num_bins as f64;
for &byte in data {
let bin = (byte as f64 / bin_size).floor() as usize;
let bin = bin.min(num_bins - 1);
histogram[bin] += 1;
}
histogram
}
#[wasm_bindgen]
pub fn parallel_sort_chunk(data: &mut [u8]) {
data.sort_unstable();
}
WASI:服务端 WebAssembly
WASI Preview 2 概览
# Cargo.toml - WASI 目标
[package]
name = "wasi-server-demo"
version = "0.1.0"
edition = "2021"
[dependencies]
wasi = "0.13"
[lib]
crate-type = ["cdylib"]
use wasi::http::{IncomingRequest, OutgoingResponse, ResponseOutparam};
use wasi::io::streams::StreamError;
#[export_name = "wasi:http/incoming-handler"]
pub extern "C" fn handle_request(
request: IncomingRequest,
response_out: ResponseOutparam,
) {
let response = OutgoingResponse::new(200);
let body = response.body().unwrap();
let write = body.write().unwrap();
write.blocking_write_and_flush(b"Hello from WASM!").unwrap();
ResponseOutparam::set(response_out, Ok(response));
}
WASM 运行时对比
| 运行时 | 语言 | WASI 支持 | 适用场景 |
|---|---|---|---|
| Wasmtime | Rust | Preview 2 | 通用服务端 |
| Wasmer | Rust | Preview 2 | 高性能嵌入式 |
| V8 | C++ | 部分支持 | 浏览器/Node.js |
| WasmEdge | C++ | Preview 2 | 边缘计算/AI |
| wazero | Go | Preview 2 | 纯 Go 嵌入式 |
常见错误与调试
编译期常见错误
// ❌ 错误:生命周期不匹配
#[wasm_bindgen]
pub fn borrow_issue(data: &[u8]) -> &[u8] {
&data[0..10] // 编译错误:返回借用超出输入生命周期
}
// ✅ 修复:返回拥有所有权的 Vec
#[wasm_bindgen]
pub fn borrow_fix(data: &[u8]) -> Vec<u8> {
data[0..10].to_vec()
}
运行时调试技巧
// 启用 WASM 调试日志
const wasmInstance = await WebAssembly.instantiate(wasmModule, {
env: {
__console_log: (ptr, len) => {
const message = new TextDecoder().decode(
new Uint8Array(wasmInstance.exports.memory.buffer, ptr, len)
);
console.log('[WASM]', message);
}
}
});
// 使用 wasm-bindgen 的 console_log 宏
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
extern "C" {
#[wasm_bindgen(js_namespace = console)]
fn log(s: &str);
}
macro_rules! wasm_log {
($($arg:tt)*) => {
log(&format!($($arg)*))
};
}
#[wasm_bindgen]
pub fn debuggable_function(input: &[u8]) -> Vec<u8> {
wasm_log!("Input length: {}", input.len());
let result: Vec<u8> = input.iter().map(|&b| b.wrapping_add(1)).collect();
wasm_log!("Output length: {}", result.len());
result
}
常见陷阱与解决方案
| 陷阱 | 症状 | 解决方案 |
|---|---|---|
| 频繁字符串传递 | 性能骤降 | 使用 js_sys::JsString 或共享内存 |
| 大数组拷贝 | 内存翻倍 | 传递指针 + 长度,直接操作 WASM 内存 |
| panic 处理 | 静默崩溃 | 设置 console_error_panic_hook |
| 未释放内存 | 内存持续增长 | 实现 Drop trait 或手动管理 |
| BigInt 开销 | 64 位整数慢 | 尽量使用 u32/i32 |
高级优化技巧
二进制体积优化
# Cargo.toml - 体积优化配置
[profile.release]
opt-level = "z" # 优化体积而非速度
lto = true # 链接时优化
codegen-units = 1 # 单编译单元,更好优化
strip = true # 去除调试符号
panic = "abort" # abort 替代 unwind,减小体积
[dependencies]
wasm-bindgen = { version = "0.2", features = ["enable-minimal-size"] }
# 使用 wasm-opt 进一步优化
wasm-opt -Oz -o output.wasm input.wasm
# 使用 wasm-snip 去除未使用函数
wasm-snip --snip-rust-panicking-code input.wasm -o output.wasm
# 体积对比
# 默认构建: ~150KB
# opt-level=z: ~85KB
# + wasm-opt: ~62KB
# + wasm-snip: ~48KB
SIMD 向量化
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
pub fn simd_add_arrays(a: &[f32], b: &[f32], result: &mut [f32]) {
#[cfg(target_feature = "simd128")]
{
use std::arch::wasm32::*;
let chunks = a.chunks_exact(4)
.zip(b.chunks_exact(4))
.zip(result.chunks_exact_mut(4));
for ((a_chunk, b_chunk), mut r_chunk) in chunks {
let va = v128_load(a_chunk.as_ptr() as *const v128);
let vb = v128_load(b_chunk.as_ptr() as *const v128);
let sum = f32x4_add(va, vb);
v128_store(r_chunk.as_mut_ptr() as *mut v128, sum);
}
}
#[cfg(not(target_feature = "simd128"))]
{
for i in 0..result.len().min(a.len()).min(b.len()) {
result[i] = a[i] + b[i];
}
}
}
流式实例化
// 流式编译与实例化:减少首次加载时间
async function streamInstantiateWasm(url, imports) {
const response = await fetch(url);
if (!response.headers.get('content-type')?.includes('wasm')) {
console.warn('服务器未设置正确的 WASM Content-Type');
}
const { instance } = await WebAssembly.instantiateStreaming(
response,
imports
);
return instance.exports;
}
// 使用示例
const wasm = await streamInstantiateWasm('./wasm_perf_demo.wasm', {});
console.log('WASM ready!');
真实案例:在线图像处理引擎
架构设计
┌─────────────┐ ┌──────────────┐ ┌──────────────┐
│ 用户上传图像 │────▶│ 主线程调度器 │────▶│ Worker 池 │
└─────────────┘ │ (任务分配) │ │ (WASM 计算) │
└──────────────┘ └──────────────┘
│ │
▼ ▼
┌──────────────┐ ┌──────────────┐
│ 进度回调 │◀────│ 结果聚合 │
│ (UI 更新) │ │ (图像合成) │
└──────────────┘ └──────────────┘
完整实现
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
pub struct ImageProcessor {
width: u32,
height: u32,
data: Vec<u8>,
}
#[wasm_bindgen]
impl ImageProcessor {
#[wasm_bindgen(constructor)]
pub fn new(width: u32, height: u32) -> Self {
let data = vec![0u8; (width * height * 4) as usize];
Self { width, height, data }
}
pub fn load_data(&mut self, data: &[u8]) {
let copy_len = data.len().min(self.data.len());
self.data[..copy_len].copy_from_slice(&data[..copy_len]);
}
pub fn apply_grayscale(&mut self) {
for pixel in self.data.chunks_exact_mut(4) {
let gray = (pixel[0] as f32 * 0.299
+ pixel[1] as f32 * 0.587
+ pixel[2] as f32 * 0.114) as u8;
pixel[0] = gray;
pixel[1] = gray;
pixel[2] = gray;
}
}
pub fn apply_brightness(&mut self, factor: f32) {
for pixel in self.data.chunks_exact_mut(4) {
pixel[0] = (pixel[0] as f32 * factor).min(255.0) as u8;
pixel[1] = (pixel[1] as f32 * factor).min(255.0) as u8;
pixel[2] = (pixel[2] as f32 * factor).min(255.0) as u8;
}
}
pub fn apply_contrast(&mut self, contrast: f32) {
let intercept = 128.0 * (1.0 - contrast);
for pixel in self.data.chunks_exact_mut(4) {
pixel[0] = (pixel[0] as f32 * contrast + intercept).clamp(0.0, 255.0) as u8;
pixel[1] = (pixel[1] as f32 * contrast + intercept).clamp(0.0, 255.0) as u8;
pixel[2] = (pixel[2] as f32 * contrast + intercept).clamp(0.0, 255.0) as u8;
}
}
pub fn get_data(&self) -> &[u8] {
&self.data
}
}
💡 使用 哈希计算 工具验证处理前后图像数据的完整性。
常见问题 FAQ
Q1: WebAssembly 能完全替代 JavaScript 吗?
不能。WASM 擅长计算密集型任务,但无法直接操作 DOM。2026 年最佳实践是混合架构:JS 负责 UI 交互和 DOM 操作,WASM 负责数据处理和算法计算。
Q2: 什么时候应该选择 WASM 而非 JavaScript?
- 图像/视频/音频处理
- 加密与哈希计算
- 数据压缩/解压
- 大规模数据排序/搜索
- 物理引擎/游戏逻辑
- AI 推理(ONNX Runtime WASM)
Q3: Rust 编译的 WASM 体积太大怎么办?
- 设置
opt-level = "z"+lto = true+panic = "abort" - 使用
wasm-opt -Oz后处理 - 使用
wasm-snip移除 panic 基础设施 - 启用
gzip/brotli压缩传输(WASM 压缩率极高) - 按功能拆分为多个 WASM 模块,按需加载
Q4: WASM 模块如何做版本管理和缓存?
// 基于 Content Hash 的缓存策略
const WASM_VERSION = 'v1.2.3';
const CACHE_KEY = `wasm-module-${WASM_VERSION}`;
async function loadWasmWithCache() {
const cache = await caches.open('wasm-cache');
let response = await cache.match(CACHE_KEY);
if (!response) {
response = await fetch('./wasm_perf_demo.wasm');
await cache.put(CACHE_KEY, response.clone());
}
const { instance } = await WebAssembly.instantiateStreaming(response);
return instance.exports;
}
Q5: 如何在 WASM 中处理错误?
use wasm_bindgen::prelude::*;
#[wasm_bindgen]
pub enum WasmError {
InvalidInput,
OutOfMemory,
ProcessingFailed,
}
#[wasm_bindgen]
pub fn safe_process(input: &[u8]) -> Result<Vec<u8>, WasmError> {
if input.is_empty() {
return Err(WasmError::InvalidInput);
}
if input.len() > 10 * 1024 * 1024 {
return Err(WasmError::OutOfMemory);
}
Ok(input.iter().map(|&b| b.wrapping_add(1)).collect())
}
总结与展望
WebAssembly 在 2026 年已从浏览器扩展到全栈:WASM Component Model 实现了语言无关的模块互操作,WASI 让服务端 WASM 成为容器化的轻量替代,SIMD 和多线程支持让 WASM 性能逼近原生代码。
核心要点回顾:
- 选对场景:计算密集型用 WASM,DOM 操作用 JS
- 工具链成熟:wasm-pack + wasm-bindgen 让 Rust→WASM 开发体验流畅
- 内存是关键:避免频繁跨边界数据拷贝,善用共享内存
- 并行化加速:Web Worker + WASM 充分利用多核 CPU
- 体积优化:lto + wasm-opt + brotli 压缩让加载速度可控
- SIMD 向量化:数值计算场景可获得额外 2-4 倍加速
💡 探索更多工具:Base64 编解码、JSON 格式化、哈希计算
本站提供浏览器本地工具,免注册即可试用 →
#WebAssembly#WASM#Rust#性能优化#教程