- How to determine, whether a file is open?
- How to Check Open Files in Linux
- Pre-Requisites
- LSOF Utility
- How to Install lsof on Debian/Ubuntu
- How to Install on REHL/CentOS
- How to Install on Arch
- How to Install on Fedora
- Basic lsof Usage
- How to Show Processes that Opened a File
- How Show files Opened by a Specific User
- How to Show Files Opened by a Specific Process
- How to Show Files Opened in a Directory
- How to Show Network Connection
- How to Continuously Show Files
- Conclusion
- About the author
- John Otieno
- How to check if a file is opened in Linux?
- 4 Answers 4
How to determine, whether a file is open?
My code needs to go through files in a directory, picking only those, which are currently opened (for writing) by any other process on the system. The ideal solution would apply for all Unixes, but I’ll settle for a Linux-only. The program is written in Python, but I can add a custom C-function, if I have to — I just need to know, what API is available for this. One suggestion I found was to go through all file-descriptors under Linux /proc , resolving their links to see, if they point at the file of interest. But that seems rather heavy. I know, for example, that opening a file increases its reference count — filesystem will not deallocate blocks of an opened file even if it is deleted — until it is closed — the feature relied upon by tmpfile(3) . Perhaps, a user process can get access to these records in the kernel?
Yeah, lsof — and fuser — scan /proc . But that yields more information than I need — I don’t care, which processes have the file open. I just want to know, whether any such exist. Perhaps, this information can be obtained more cheaply, than /proc rescan?
The advantage of scanning /proc is that it is backed by direct kernel calls, not a physical file system. That gives /proc a huge performance advantage over opening and reading a directory, even just to find the names.
The advantage of scanning /proc is that it is the only way to get the information without modifying the kernel.
How to Check Open Files in Linux
You may have come across the saying, “Everything is a file in Linux.” Although this is not entirely true, it does hold a set of truths to it.
In Linux and Unix-like systems, everything is like a file. That means the resources in the Unix system get assigned a file descriptor, including storage devices, network sockets, processes, etc.
A file descriptor is a unique number that identifies a file and other input/output devices. It describes resources and how the kernel accesses them. Think of it as a gateway to the Kernel abstraction hardware resources.
Unfortunately, the concept of file descriptors is beyond the scope of this tutorial; consider the link provided below to get started on learning more:
That means that Unix and Unix-like systems such as Linux use such files heavily. As a Linux power user, seeing the open files and the process and users using them is incredibly useful.
This tutorial will focus on ways to view the files open and which process or user is responsible.
Pre-Requisites
Before we begin, ensure that you have:
If you have these, let us get started:
LSOF Utility
Created by Victor A Abell, List open files, or lsof for short, is a command-line utility that allows us to view the open files and the processes or users who opened them.
The lsof utility is available in major Linux distributions; however, you may find it not installed and thus may need to install manually.
How to Install lsof on Debian/Ubuntu
To install it on Debian, use the command:
sudo apt-get install lsof -y
How to Install on REHL/CentOS
To install on REHL and CentOS, use the command:
How to Install on Arch
On Arch, call the package manager using the command:
How to Install on Fedora
On Fedora, use the command:
Once you have the lsof utility installed and updated, we can begin using it.
Basic lsof Usage
To use the lsof tool, enter the command:
Once you execute the above command, lsof will dump a lot of information as shown below:
The above output shows all the files opened by the processes. The output has various columns, each representing specific information about the file.
- The COMMAND column – shows the name of the process that is using the file.
- PID – shows the Process Identifier of the process using the file.
- The TID – Shows the task ID (threads) of the process.
- TASKCMD – Represent the name of the task command.
- USER – The owner of the process.
- FD – Shows the file descriptor number. This is how processes use the file; the options available in this column output include:
- cwd – current working directory.
- mem – memory-mapped file
- pd – parent directory
- jld – jail directory
- ltx – shared library text
- rtd – root directory.
- txt – program code and data
- tr – kernel trace file.
- err – File descriptor information error
- mmp – Memory-mapped device.
- TYPE – Shows the type of node associated with the file, such as:
- Unix – for Unix domain socket.
- DIR – represents the directory
- REG – representing the regular file
- CHR – represents the special character file.
- LINK – symbolic link file
- BLK – Block special file
- INET – Internet domain socket
- FIFO – a named pipe (First In First Out file)
- PIPE – for pipes
- DEVICES – Shows the device numbers separated by commas in the order of special character file, block special, regular, directory, and NFS file.
- SIZE/OFF – shows the size of the file pr file offset in bytes.
- NODE – shows the node number of the local file, type for internet protocol type, etc.
- NAME – shows the name of the mount point and fs on which the file is located.
Note: Please Refer to the lsof Manual for detailed information on the columns.
How to Show Processes that Opened a File
Lsof provides us with options that help us filter the output to show only the processes that opened a specific file.
For example, to see the file that opened the file /bin/bash, use the command as:
This will give you an output as shown below:
COMMAND PID USER FD TYPE DEVICE SIZE / OFF NODE NAME
ksmtuned 1025 root txt REG 253 , 0 1150704 428303 / usr / bin / bash
bash 2968 centos txt REG 253 , 0 1150704 428303 / usr / bin / bash
bash 3075 centos txt REG 253 , 0 1150704 428303 / usr / bin / bash
How Show files Opened by a Specific User
We can also filter the output to show the files opened by a specific user. We do this by using the -u flag followed by the username as:
This will give you an output as shown below:
How to Show Files Opened by a Specific Process
Suppose we want to view all the files opened by a specific process? For this, we can use the PID of the process to filter the output.
For example, the below command shows the files opened by bash.
This will give you only the files opened by systemd as shown:
How to Show Files Opened in a Directory
To get the files opened in a specific directory, we can pass the +D option followed by the directory path.
For example, list open files in the /etc directory.
Below is the output for this:
How to Show Network Connection
Since everything in Linux is a file, we can get the network files such as TCP files or connections.
This will give you the TCP connections in the system.
You can also filter by the specific port using the command shown below:
This will give you the output as shown below:
How to Continuously Show Files
Lsof provides us with a mode to loop the output every few seconds. This allows you to monitor the files opened by a process or user continuously.
This option, however, requires you to terminate the process manually.
For example, the command below continuously monitors the files opened on port 22:
As you can see, in the third loop, lsof catches the established connection to the server on SSH.
Conclusion
Lsof is an incredibly useful utility. It allows you to monitor for critical files as well as monitor users and processes opening files. This can be incredibly useful when troubleshooting or looking for malicious attempts to the system.
As shown in this tutorial, using various examples and methods, you can combine the functionality provided by the lsof tool for custom monitoring.
Thank you for reading and sharing! I hope you learned something new!
About the author
John Otieno
My name is John and am a fellow geek like you. I am passionate about all things computers from Hardware, Operating systems to Programming. My dream is to share my knowledge with the world and help out fellow geeks. Follow my content by subscribing to LinuxHint mailing list
How to check if a file is opened in Linux?
The thing is, I want to track if a user tries to open a file on a shared account. I’m looking for any record/technique that helps me know if the concerned file is opened, at run time. I want to create a script which monitors if the file is open, and if it is, I want it to send an alert to a particular email address. The file I’m thinking of is a regular file. I tried using lsof | grep filename for checking if a file is open in gedit, but the command doesn’t return anything. Actually, I’m trying this for a pet project, and thus the question.
4 Answers 4
The command lsof -t filename shows the IDs of all processes that have the particular file opened. lsof -t filename | wc -w gives you the number of processes currently accessing the file.
I would recommend wc -l instead of wc -w which works in more cases and lsof is not going to put more than one filename per line.
The fact that a file has been read into an editor like gedit does not mean that the file is still open. The editor most likely opens the file, reads its contents and then closes the file. After you have edited the file you have the choice to overwrite the existing file or save as another file.
You could (in addition of other answers) use the Linux-specific inotify(7) facilities.
I am understanding that you want to track one (or a few) particular given file, with a fixed file path (actually a given i-node). E.g. you would want to track when /var/run/foobar is accessed or modified, and do something when that happens
In particular, you might want to install and use incrond(8) and configure it thru incrontab(5)
If you want to run a script when some given file (on a native local, e.g. Ext4, BTRS, . but not NFS file system) is accessed or modified, use inotify incrond is exactly done for that purpose.
PS. AFAIK, inotify don’t work well for remote network files, e.g. NFS filesystems (in particular when another NFS client machine is modifying a file).
If the files you are fond of are somehow source files, you might be interested by revision control systems (like git) or builder systems (like GNU make); in a certain way these tools are related to file modification.
You could also have the particular file system sits in some FUSE filesystem, and write your own FUSE daemon.
If you can restrict and modify the programs accessing the file, you might want to use advisory locking, e.g. flock(2), lockf(3).
Perhaps the data sitting in the file should be in some database (e.g. sqlite or a real DBMS like PostGreSQL ou MongoDB). ACID properties are important .
Notice that the filesystem and the mount options may matter a lot.
You might want to use the stat(1) command.
It is difficult to help more without understanding the real use case and the motivation. You should avoid some XY problem
Probably, the workflow is wrong (having a shared file between several users able to write it), and you should approach the overall issue in some other way. For a pet project I would at least recommend using some advisory lock, and access & modify the information only thru your own programs (perhaps setuid) using flock (this excludes ordinary editors like gedit or commands like cat . ). However, your implicit use case seems to be well suited for a DBMS approach (a database does not have to contain a lot of data, it might be tiny), or some index locked file like GDBM library is handling.
Remember that on POSIX systems and Linux, several processes can access (and even modify) the same file simultaneously (unless you use some locking or synchronization).
Reading the Advanced Linux Programming book (freely available) would give you a broader picture (but it does not mention inotify which appeared aften the book was written).