I have researched my scenario everywhere but can't find any string related to my issue. I have a datanode in Hadoop Framework , which recently went bad because all the drives on that box got umounted due to some unknown reason. These drives are mounted on directories that reside on the "/" . since the hadoop processes were still running it was writing to these directories but after the drives got unmounted it consumed all the space on root instead of separate drives which they were mounted on, so the root became full stopped the hadoop related services due to unavailability of space. Now that I mounted all the drives back and cleaned all the old data on them , my root is still showing 100%. Here is the situation:
Filesystem Size Used Avail Use% Mounted on
/dev/sda2 3.6T 3.4T 140M 100% /
tmpfs 24G 0 24G 0% /dev/shm
/dev/sda1 239M 60M 167M 27% /boot
/dev/sdb1 3.6T 15G 3.4T 1% /data-1
/dev/sdc1 3.6T 16G 3.4T 1% /data-2
/dev/sdd1 3.6T 16G 3.4T 1% /data-3
/dev/sde1 3.6T 16G 3.4T 1% /data-4
/dev/sdf1 3.6T 15G 3.4T 1% /data-5
/dev/sdg1 3.6T 15G 3.4T 1% /data-6
/dev/sdh1 3.6T 16G 3.4T 1% /data-7
/dev/sdi1 3.6T 15G 3.4T 1% /data-8
/dev/sdj1 3.6T 15G 3.4T 1% /data-9
/dev/sdk1 3.6T 15G 3.4T 1% /data-10
/dev/sdl1 3.6T 16G 3.4T 1% /data-11
cm_processes 24G 512K 24G 1% /var/run/cloudera-scm-agent/process
I have read all the threads about process still writing to the old dir but it does not imply in my case.
[root@server /]# du -sh ./*
7.7M ./bin
58M ./boot
15G ./data-1
15G ./data-10
16G ./data-11
16G ./data-2
16G ./data-3
15G ./data-4
15G ./data-5
15G ./data-6
16G ./data-7
15G ./data-8
15G ./data-9
264K ./dev
30M ./etc
18M ./files
132K ./home
260M ./lib
23M ./lib64
16K ./lost+found
4.0K ./media
4.0K ./mnt
3.7G ./opt
du: cannot access `./proc/19763/task/19763/fd/4': No such file or directory
du: cannot access `./proc/19763/task/19763/fdinfo/4': No such file or directory
du: cannot access `./proc/19763/fd/4': No such file or directory
du: cannot access `./proc/19763/fdinfo/4': No such file or directory
0 ./proc
112K ./root
14M ./sbin
4.0K ./selinux
4.0K ./srv
0 ./sys
176K ./tmp
2.2G ./usr
808M ./var
[root@server /]# lsof | grep 'deleted'
This command returns nothing. Also recycled the server but no effect. Thanks for your help.