分类 technique 下的文章

MPI && OMP 相关

作者: leenldk
时间: 2024-06-11
分类: technique

CPU 结构：
threads per core ：超线程
cores per socket ：每个 socket 核数
sockets

MPI

每个 mpi 进程有 affinity mask，长度为 CPU cores
--bind-to core : affinity mask 中只有对应 core 一位被 set
--bind-to socket : affinity mask 中 socket 对应所有 core 被 set
--bind-to none

--map-by node
--map-by socket
--map-by node:PE=8 : PE为每个进程分配的物理核数

多机

hostfile :

i1 slots=2 max-slots=8
i2 slots=2 max-slots=8

`which mpirun`  -np 2 --host i1:1,i2:1 hostname
`which mpirun` -np 4 --hostfile ./hostfile hostname

使用脚本时开头要加 #!/bin/bash

OMP

OMP_DISPLAY_ENV=true 输出 OMP 绑定情况
OMP_PLACES=threads, OMP_PLACES=cores,
OMP_PLACES=sockets

hyperthread cpu 分布：/sys/devices/system/cpu/cpu0/topology$ cat thread_siblings_list

#include <omp.h>
#include <sched.h>

    #pragma omp parallel 
    {
        int id = omp_get_thread_num();
        int max_threads = omp_get_num_threads();
        int cpuid = sched_getcpu();
        printf("hello from cpu: %d thread: %d out of %d threads @ rank = %d\n", cpuid, id, max_threads, rank);
    }

数学库相关

作者: leenldk
时间: 2023-11-05
分类: technique

BLAS : basic linear algebra subprograms (low level)
LAPACK : linear algebra package (high level，在 BLAS 之上)
ScaLAPACK : scalable LAPACK，支持 MPI 的 LAPACK

Intel MKL : 包含 BLAS, LAPACK, ScaLAPACK

ELPA : eigenvalue solvers for petaflop applications

cuBLAS

使用 cublasCreate 创建 cuBLAS library context 的 handle，
一个 context 关联一个 device
cuBLAS library context 绑定 CUDA context

shell工具

作者: leenldk
时间: 2023-04-22
分类: technique

用来记一些 tmux 之类工具的奇妙用法

tmux

ctrl+b s : 切换panel
Ctrl+b :set synchronize-panes : 多窗口同步输入

legacy 体系：

SysVinit :
初始化脚本位于 /etc/init.d/，初始化脚本需要支持 start, stop, status 等命令
使用 service 命令运行初始化脚本
SysVinit 只启动一次，在之后不跟踪单个服务

新体系：

systemd 用于代替 SysVinit ，在初始化结束时， systemd 仍然作为守护进程运行。
使用 systemctl 与 systemd 交互，运行服务。

sudo systemctl enable [service] 开机启动服务
sudo systemctl disable [service] 不开机启动服务
sudo systemctl start [service]
sudo systemctl stop [service]
sudo systemctl restart [service]

systemd

systemd 是 boot 阶段第一个运行的 userspace 程序，在 kernel 初始化硬件后马上获得控制
systemd 通过 unit configuration file 管理资源
unit 包括 .service, .mount, .device 等文件
unit 保存在 /etc/systemd/system, /lib/systemd/system 等目录
/etc/systemd/system 优先级最高
通过 systemctl 与 systemd 交互
systemd 与 systemctl 之间通过 D-Bus 交互

journald 收集和管理 systemd 的 log
通过 journalctl 查询 log