⭐ 欢迎来到虫虫下载站! | 📦 资源下载 📁 资源专辑 ℹ️ 关于我们
⭐ 虫虫下载站

📄 taskstats.txt

📁 linux 内核源代码
💻 TXT
字号:
Per-task statistics interface-----------------------------Taskstats is a netlink-based interface for sending per-task andper-process statistics from the kernel to userspace.Taskstats was designed for the following benefits:- efficiently provide statistics during lifetime of a task and on its exit- unified interface for multiple accounting subsystems- extensibility for use by future accounting patchesTerminology-----------"pid", "tid" and "task" are used interchangeably and refer to the standardLinux task defined by struct task_struct.  per-pid stats are the same asper-task stats."tgid", "process" and "thread group" are used interchangeably and refer to thetasks that share an mm_struct i.e. the traditional Unix process. Despite theuse of tgid, there is no special treatment for the task that is thread groupleader - a process is deemed alive as long as it has any task belonging to it.Usage-----To get statistics during a task's lifetime, userspace opens a unicast netlinksocket (NETLINK_GENERIC family) and sends commands specifying a pid or a tgid.The response contains statistics for a task (if pid is specified) or the sum ofstatistics for all tasks of the process (if tgid is specified).To obtain statistics for tasks which are exiting, the userspace listenersends a register command and specifies a cpumask. Whenever a task exits onone of the cpus in the cpumask, its per-pid statistics are sent to theregistered listener. Using cpumasks allows the data received by one listenerto be limited and assists in flow control over the netlink interface and isexplained in more detail below.If the exiting task is the last thread exiting its thread group,an additional record containing the per-tgid stats is also sent to userspace.The latter contains the sum of per-pid stats for all threads in the threadgroup, both past and present.getdelays.c is a simple utility demonstrating usage of the taskstats interfacefor reporting delay accounting statistics. Users can register cpumasks,send commands and process responses, listen for per-tid/tgid exit data,write the data received to a file and do basic flow control by increasingreceive buffer sizes.Interface---------The user-kernel interface is encapsulated in include/linux/taskstats.hTo avoid this documentation becoming obsolete as the interface evolves, onlyan outline of the current version is given. taskstats.h always overrides thedescription here.struct taskstats is the common accounting structure for both per-pid andper-tgid data. It is versioned and can be extended by each accounting subsystemthat is added to the kernel. The fields and their semantics are defined in thetaskstats.h file.The data exchanged between user and kernel space is a netlink message belongingto the NETLINK_GENERIC family and using the netlink attributes interface.The messages are in the format    +----------+- - -+-------------+-------------------+    | nlmsghdr | Pad |  genlmsghdr | taskstats payload |    +----------+- - -+-------------+-------------------+The taskstats payload is one of the following three kinds:1. Commands: Sent from user to kernel. Commands to get data ona pid/tgid consist of one attribute, of type TASKSTATS_CMD_ATTR_PID/TGID,containing a u32 pid or tgid in the attribute payload. The pid/tgid denotesthe task/process for which userspace wants statistics.Commands to register/deregister interest in exit data from a set of cpusconsist of one attribute, of typeTASKSTATS_CMD_ATTR_REGISTER/DEREGISTER_CPUMASK and contain a cpumask in theattribute payload. The cpumask is specified as an ascii string ofcomma-separated cpu ranges e.g. to listen to exit data from cpus 1,2,3,5,7,8the cpumask would be "1-3,5,7-8". If userspace forgets to deregister interestin cpus before closing the listening socket, the kernel cleans up its interestset over time. However, for the sake of efficiency, an explicit deregistrationis advisable.2. Response for a command: sent from the kernel in response to a userspacecommand. The payload is a series of three attributes of type:a) TASKSTATS_TYPE_AGGR_PID/TGID : attribute containing no payload but indicatesa pid/tgid will be followed by some stats.b) TASKSTATS_TYPE_PID/TGID: attribute whose payload is the pid/tgid whose statsare being returned.c) TASKSTATS_TYPE_STATS: attribute with a struct taskstats as payload. Thesame structure is used for both per-pid and per-tgid stats.3. New message sent by kernel whenever a task exits. The payload consists of a   series of attributes of the following type:a) TASKSTATS_TYPE_AGGR_PID: indicates next two attributes will be pid+statsb) TASKSTATS_TYPE_PID: contains exiting task's pidc) TASKSTATS_TYPE_STATS: contains the exiting task's per-pid statsd) TASKSTATS_TYPE_AGGR_TGID: indicates next two attributes will be tgid+statse) TASKSTATS_TYPE_TGID: contains tgid of process to which task belongsf) TASKSTATS_TYPE_STATS: contains the per-tgid stats for exiting task's processper-tgid stats--------------Taskstats provides per-process stats, in addition to per-task stats, sinceresource management is often done at a process granularity and aggregating taskstats in userspace alone is inefficient and potentially inaccurate (due to lackof atomicity).However, maintaining per-process, in addition to per-task stats, within thekernel has space and time overheads. To address this, the taskstats codeaccumulates each exiting task's statistics into a process-wide data structure.When the last task of a process exits, the process level data accumulated alsogets sent to userspace (along with the per-task data).When a user queries to get per-tgid data, the sum of all other live threads inthe group is added up and added to the accumulated total for previously exitedthreads of the same thread group.Extending taskstats-------------------There are two ways to extend the taskstats interface to export moreper-task/process stats as patches to collect them get added to the kernelin future:1. Adding more fields to the end of the existing struct taskstats. Backward   compatibility is ensured by the version number within the   structure. Userspace will use only the fields of the struct that correspond   to the version its using.2. Defining separate statistic structs and using the netlink attributes   interface to return them. Since userspace processes each netlink attribute   independently, it can always ignore attributes whose type it does not   understand (because it is using an older version of the interface).Choosing between 1. and 2. is a matter of trading off flexibility andoverhead. If only a few fields need to be added, then 1. is the preferablepath since the kernel and userspace don't need to incur the overhead ofprocessing new netlink attributes. But if the new fields expand the existingstruct too much, requiring disparate userspace accounting utilities tounnecessarily receive large structures whose fields are of no interest, thenextending the attributes structure would be worthwhile.Flow control for taskstats--------------------------When the rate of task exits becomes large, a listener may not be able to keepup with the kernel's rate of sending per-tid/tgid exit data leading to dataloss. This possibility gets compounded when the taskstats structure getsextended and the number of cpus grows large.To avoid losing statistics, userspace should do one or more of the following:- increase the receive buffer sizes for the netlink sockets opened bylisteners to receive exit data.- create more listeners and reduce the number of cpus being listened to byeach listener. In the extreme case, there could be one listener for each cpu.Users may also consider setting the cpu affinity of the listener to the subsetof cpus to which it listens, especially if they are listening to just one cpu.Despite these measures, if the userspace receives ENOBUFS error messagesindicated overflow of receive buffers, it should take measures to handle theloss of data.----

⌨️ 快捷键说明

复制代码 Ctrl + C
搜索代码 Ctrl + F
全屏模式 F11
切换主题 Ctrl + Shift + D
显示快捷键 ?
增大字号 Ctrl + =
减小字号 Ctrl + -