thr3ads.net - Ocfs2 users - [Ocfs2-users] ls [Apr 2015]

If this information is useful, please help other people find it:
Share via:

leopoldo tosi

2015-Apr-02 13:41 UTC

[Ocfs2-users] ls

OS ubuntu
ub-ocfs1 and ub-ocfs:~# uname -a
Linux ub-ocfs 3.5.0-17-generic #28-Ubuntu SMP Tue Oct 9 19:31:23 UTC 2012 x86_64
x86_64 x86_64 GNU/Linux

on both:

ub-ocfs:~# service o2cb start
Loading stack plugin "user": OK
Loading filesystem "ocfs2_dlmfs": OK
Mounting ocfs2_dlmfs filesystem at /dlm: OK
Setting cluster stack "cman": OK
Loading filesystem "ocfs2": OK
Starting ocfs2_controld.cman: OK

ub-ocfs:~# mount /dev/drbd/by-res/wwwdata /opt/

ub-ocfs:~# service drbd status
drbd driver loaded OK; device status:
version: 8.3.13 (api:88/proto:86-96)
srcversion: 697DE8B1973B1D8914F04DB 
m:res      cs         ro               ds                 p  mounted  fstype
1:wwwdata  Connected  Primary/Primary  UpToDate/UpToDate  C  /opt     ocfs2

ub-ocfs:~# ls /opt/
local  lost+found

ub-ocfs:~# ls /opt/local
Segmentation fault

ub-ocfs:~# ps fa
  PID TTY      STAT   TIME COMMAND
 2472 pts/2    Ss     0:00 -bash
 2529 pts/2    R+     0:00  \_ ps fa
 2299 pts/1    Ss     0:00 -bash
 2356 pts/1    S+     0:00  \_ -bash
 2361 pts/1    D+     0:00      \_ ls -l /opt/local/etc /opt/local/games
/opt/local/include /opt/local/leo1....


the script ls -l /opt/local/*

ub-ocfs:~# cat /opt/local/leo-ocfs6.sh
date ; echo -n
hostname
ls -l
ls -l /opt/local/le*
ls -l /opt/local/*


I can work normal

ub-ocfs1:~# rm -r /opt/local/bin
ub-ocfs1:~# mkdir /opt/local/etc/bin
ub-ocfs1:~# ls /opt/local/leo*
/opt/local/leo1  /opt/local/leo3  /opt/local/leo-ocfs   /opt/local/leo-ocfs2 
/opt/local/leo-ocfs4  /opt/local/leo-ocfs6
/opt/local/leo2  /opt/local/leo4  /opt/local/leo-ocfs1  /opt/local/leo-ocfs3 
/opt/local/leo-ocfs5  /opt/local/leo-ocfs6.sh


but resource is busy if I want restart or unload

ocfs:~# service o2cb unload 
Stopping ocfs2_controld.cman: /sbin/start-stop-daemon: warning: this system is
not able to track process names
longer than 15 characters, please use --exec instead of --name.
Failed
Unable to unload modules as the cluster is still online


because '2361 pts/1    D+     0:00      \_ ls -l /opt/local/' is in 
'D    uninterruptible sleep (usually IO)'

if I want the control back I need reboot host, 

anybody have another solution ?-:-(

by 
? leopoldo tosi

Zhen Ren

2015-Apr-03 02:38 UTC

head link

[Ocfs2-users] ls

Hi leopoldo,


--
Best regards,
Eric, Ren
HA team, SUSE

>>> 
>  
> OS ubuntu 
> ub-ocfs1 and ub-ocfs:~# uname -a 
> Linux ub-ocfs 3.5.0-17-generic #28-Ubuntu SMP Tue Oct 9 19:31:23 UTC 2012
x86_64
> x86_64 x86_64 GNU/Linux 
>  
> on both: 
>  
> ub-ocfs:~# service o2cb start 
> Loading stack plugin "user": OK 
> Loading filesystem "ocfs2_dlmfs": OK 
> Mounting ocfs2_dlmfs filesystem at /dlm: OK 
> Setting cluster stack "cman": OK 
> Loading filesystem "ocfs2": OK 
> Starting ocfs2_controld.cman: OK 
>  
> ub-ocfs:~# mount /dev/drbd/by-res/wwwdata /opt/ 
>  
> ub-ocfs:~# service drbd status 
> drbd driver loaded OK; device status: 
> version: 8.3.13 (api:88/proto:86-96) 
> srcversion: 697DE8B1973B1D8914F04DB  
> m:res      cs         ro               ds                 p  mounted 
fstype
> 1:wwwdata  Connected  Primary/Primary  UpToDate/UpToDate  C  /opt     ocfs2
the ds "Primary/Primary" is in question.

It seems that DRBD  cannot work when both nodes is primary.

This doc may help you get some hint.
https://www.suse.com/documentation/sle_ha/book_sleha/data/sec_ha_drbd_configure.html

Also,inproper configuration might lead to this issue.
>  
> ub-ocfs:~# ls /opt/ 
> local  lost+found 
>  
> ub-ocfs:~# ls /opt/local 
> Segmentation fault 
IIRC,I also got seg-fault problem for some unclear reason,which may be caused by
setting up problem.
>  
> ub-ocfs:~# ps fa 
>   PID TTY      STAT   TIME COMMAND 
>  2472 pts/2    Ss     0:00 -bash 
>  2529 pts/2    R+     0:00  \_ ps fa 
>  2299 pts/1    Ss     0:00 -bash 
>  2356 pts/1    S+     0:00  \_ -bash 
>  2361 pts/1    D+     0:00      \_ ls -l /opt/local/etc /opt/local/games  
> /opt/local/include /opt/local/leo1.... 
>  
>  
> the script ls -l /opt/local/* 
You mean ls -l /opt/local go bad, but ls -l /opt/local/* can work well?
>  
> ub-ocfs:~# cat /opt/local/leo-ocfs6.sh 
> date ; echo -n 
> hostname 
> ls -l 
> ls -l /opt/local/le* 
> ls -l /opt/local/* 
>  
>  
> I can work normal 
what do you mean by work normal? ;-)
other cmd can work, except ls -l /opt/local, though seg-fault already happened? 
>  
> ub-ocfs1:~# rm -r /opt/local/bin 
> ub-ocfs1:~# mkdir /opt/local/etc/bin 
> ub-ocfs1:~# ls /opt/local/leo* 
> /opt/local/leo1  /opt/local/leo3  /opt/local/leo-ocfs  
/opt/local/leo-ocfs2
> /opt/local/leo-ocfs4  /opt/local/leo-ocfs6 
> /opt/local/leo2  /opt/local/leo4  /opt/local/leo-ocfs1 
/opt/local/leo-ocfs3
> /opt/local/leo-ocfs5  /opt/local/leo-ocfs6.sh 
>  
>  
> but resource is busy if I want restart or unload 
>  
don't worry about this.
when the previous problem done, this would go away.
> ocfs:~# service o2cb unload  
> Stopping ocfs2_controld.cman: /sbin/start-stop-daemon: warning: this system
is
> not able to track process names 
> longer than 15 characters, please use --exec instead of --name. 
> Failed 
> Unable to unload modules as the cluster is still online 
>  
>  
> because '2361 pts/1    D+     0:00      \_ ls -l /opt/local/' is in
'D
> uninterruptible sleep (usually IO)' 
>  
> if I want the control back I need reboot host,  
>  
> anybody have another solution ?-:-( 
hm,what's your solution? ;-)
>  
> by  
>   leopoldo tosi 
>  
> _______________________________________________ 
> Ocfs2-users mailing list 
> Ocfs2-users at oss.oracle.com 
> https://oss.oracle.com/mailman/listinfo/ocfs2-users 
>  
>

Ocfs2 users - Apr 2015 - ls

[Ocfs2-users] ls

[Ocfs2-users] ls