View Issue Details

IDProjectCategoryView StatusLast Update
0004491SOGoBackend Generalpublic2019-02-14 19:09
Reportergregk@kgbconsulting.ca Assigned Toludovic  
PrioritynormalSeveritycrashReproducibilityhave not tried
Status closedResolutionno change required 
Platform[Server] LinuxOSUbuntuOS Version16.04 LTS
Product Version4.0.0 
Summary0004491: sogod kills all children and busy loops
Description

I found two Sogo servers in this state yesterday. Here are some quick facts:

  1. only one sogod process is running
  2. sogod is consuming 100% cpu
  3. simple "kill" did not kill it. kill -9 worked

The tail end of the logs looks like this (full log attached):
May 25 23:03:03 sogod [3395]: |SOGo| starting method 'GET' on uri '/SOGo'
May 25 23:03:03 sogod [3395]: |SOGo| request took 0.000622 seconds to execute
May 25 23:03:03 sogod [3395]: 10.101.16.150 "GET /SOGo HTTP/1.0" 302 0/0 0.001 -

  • 0
    May 25 23:03:05 sogod [3272]: <0x0x5596b0891840[WOWatchDog]> Terminating with SI
    GINT or SIGTERM
    May 25 23:03:05 sogod [3272]: <0x0x5596b07f4950[WOWatchDogChild]> sending termin
    ate signal to pid 3407
    May 25 23:03:05 sogod [3272]: <0x0x5596b07f3270[WOWatchDogChild]> sending termin
    ate signal to pid 3406
    May 25 23:03:05 sogod [3272]: <0x0x5596b07f31c0[WOWatchDogChild]> sending termin

<snip>

MySQL4 connection dropped 0x0x5596b0f93800 (channel=0x0x5596b0ff86f0)
<data>:27: element div: validity error : ID compose-container already defined
r" itemscope="" itemtype="https://schema.org/EmailMessage&quot; style="direction:ltr"
^
<data>:39: element div: validity error : ID divRplyFwdMsg already defined
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#00000
^
MySQL4 connection dropped 0x0x5596b0f93800 (channel=0x0x5596b2256490)

< snip > < Lots more terminating processes and dropped MySQL4 connections >

May 25 23:03:24 sogod [3272]: <0x0x5596b0891840[WOWatchDog]> Terminating with SI
GINT or SIGTERM
May 25 23:03:25 sogod [3272]: <0x0x5596b0891840[WOWatchDog]> Terminating with SI
GINT or SIGTERM
May 25 23:03:25 sogod [3272]: <0x0x5596b0891840[WOWatchDog]> Terminating with SI
GINT or SIGTERM
May 25 23:03:26 sogod [16544]: version 4.0.0 (build @shiva.inverse 201803170710)
-- starting
May 25 23:03:26 sogod [16544]: vmem size check enabled: shutting down app when v
mem > 500 MB. Currently at 224 MB
May 25 23:03:27 sogod [16544]: <0x0x5572d859f220[SOGoProductLoader]> SOGo produc
ts loaded from '/usr/lib/GNUstep/SOGo':
May 25 23:03:27 sogod [16544]: <0x0x5572d859f220[SOGoProductLoader]> Appointme
nts.SOGo, AdministrationUI.SOGo, SchedulerUI.SOGo, MailPartViewers.SOGo, ActiveS
ync.SOGo, PreferencesUI.SOGo, MainUI.SOGo, Contacts.SOGo, Mailer.SOGo, CommonUI.
SOGo, MailerUI.SOGo, ContactsUI.SOGo
May 25 23:03:27 sogod [16544]: All products loaded - current memory usage at 278
MB
May 25 23:03:27 sogod [16544]: [WARN] <0x0x5572d8705840[WOWatchDog]> listening s
ocket: attempt 1 failed
May 25 23:03:28 sogod [16544]: [WARN] <0x0x5572d8705840[WOWatchDog]> listening s
ocket: attempt 2 failed
May 25 23:03:29 sogod [16544]: [WARN] <0x0x5572d8705840[WOWatchDog]> listening s
ocket: attempt 3 failed
May 25 23:03:30 sogod [16544]: [WARN] <0x0x5572d8705840[WOWatchDog]> listening s
ocket: attempt 4 failed
May 25 23:03:31 sogod [16544]: [WARN] <0x0x5572d8705840[WOWatchDog]> listening s
ocket: attempt 5 failed
May 25 23:03:32 sogod [16544]: [ERROR] <0x0x5572d8705840[WOWatchDog]> unable to
listen on specified port, check that no other process is already using it
....
<snip> < days worth of failed restarts >

Steps To Reproduce

Does "Terminating with SIGINT or SIGTERM" mean something tried to shut it down ? It does not happen on every shutdown

TagsNo tags attached.

Activities

gregk@kgbconsulting.ca

gregk@kgbconsulting.ca

2018-06-21 02:49

reporter  

sogo.log.26 (66,029 bytes)
gregk@kgbconsulting.ca

gregk@kgbconsulting.ca

2018-06-21 02:52

reporter  

gbd_bt.txt (2,777 bytes)   
(gdb) bt
#0  0x00007f2aee33bcf4 in ?? () from /usr/lib/libNGStreams.so.4.9
#1  0x00007f2aee85c397 in ?? () from /usr/lib/libNGMime.so.4.9
#2  0x00007f2aee85ce01 in ?? () from /usr/lib/libNGMime.so.4.9
#3  0x00007f2aee86e846 in ?? () from /usr/lib/libNGMime.so.4.9
#4  0x00007f2aee86ee14 in ?? () from /usr/lib/libNGMime.so.4.9
#5  0x00007f2aee86b41b in ?? () from /usr/lib/libNGMime.so.4.9
#6  0x00007f2aee899e55 in ?? () from /usr/lib/libNGMime.so.4.9
#7  0x00007f2ae1e33076 in ?? () from /usr/lib/GNUstep/SOGo/Mailer.SOGo/./Mailer
#8  0x00007f2adf307faf in ?? ()
   from /usr/lib/GNUstep/SOGo/MailerUI.SOGo/./MailerUI
#9  0x00007f2adf307a0d in ?? ()
   from /usr/lib/GNUstep/SOGo/MailerUI.SOGo/./MailerUI
#10 0x00007f2adf307d90 in ?? ()
   from /usr/lib/GNUstep/SOGo/MailerUI.SOGo/./MailerUI
#11 0x00007f2aeec69fae in ?? () from /usr/lib/libNGObjWeb.so.4.9
#12 0x00007f2ae0714d73 in ?? () from /usr/lib/sogo/libSOGoUI.so.4.0
#13 0x00007f2aeecfb31f in ?? () from /usr/lib/libNGObjWeb.so.4.9
#14 0x00007f2aeecfb44b in ?? () from /usr/lib/libNGObjWeb.so.4.9
#15 0x00007f2aeecfb093 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#16 0x00007f2aeecfb44b in ?? () from /usr/lib/libNGObjWeb.so.4.9
#17 0x00007f2aeecf594c in ?? () from /usr/lib/libNGObjWeb.so.4.9
#18 0x00007f2aeecf7e0d in ?? () from /usr/lib/libNGObjWeb.so.4.9
#19 0x00007f2aeec7bd0f in ?? () from /usr/lib/libNGObjWeb.so.4.9
#20 0x00007f2aeec3c09e in ?? () from /usr/lib/libNGObjWeb.so.4.9
#21 0x00007f2aeec3c3bd in ?? () from /usr/lib/libNGObjWeb.so.4.9
#22 0x00005596b0152c69 in ?? ()
#23 0x00007f2aeece6353 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#24 0x00007f2aeece66c9 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#25 0x00007f2aeece233e in ?? () from /usr/lib/libNGObjWeb.so.4.9
#26 0x00007f2aeece255e in ?? () from /usr/lib/libNGObjWeb.so.4.9
#27 0x00007f2aeece2967 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#28 0x00007f2aeece2c4e in ?? () from /usr/lib/libNGObjWeb.so.4.9
#29 0x00007f2aed3ff0d7 in ?? () from /usr/lib/libgnustep-base.so.1.24
#30 0x00007f2aee09db23 in ?? () from /usr/lib/libNGExtensions.so.4.9
#31 0x00007f2aed506d2b in ?? () from /usr/lib/libgnustep-base.so.1.24
#32 0x00007f2aed445500 in ?? () from /usr/lib/libgnustep-base.so.1.24
#33 0x00007f2aed4459b2 in ?? () from /usr/lib/libgnustep-base.so.1.24
#34 0x00007f2aeec3b8c9 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#35 0x00005596b0151db0 in ?? ()
#36 0x00007f2aeec853b4 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#37 0x00007f2aeec85860 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#38 0x00007f2aeec85f98 in ?? () from /usr/lib/libNGObjWeb.so.4.9
#39 0x00007f2aeec8702d in ?? () from /usr/lib/libNGObjWeb.so.4.9
#40 0x00007f2aeec8777d in WOWatchDogApplicationMain ()
   from /usr/lib/libNGObjWeb.so.4.9
#41 0x00005596b0150a79 in main ()
(gdb) quit

gbd_bt.txt (2,777 bytes)   
gregk@kgbconsulting.ca

gregk@kgbconsulting.ca

2018-06-21 02:52

reporter   ~0012920

strace of the spinning sogod process shows no system calls.
gdb backtrace is attached, but likely useless since no gdb found no symbols.

ludovic

ludovic

2019-02-14 19:09

administrator   ~0013356

Likely due to broken IMAP connection - so the child even if hung, would be killed and respawned.

Issue History

Date Modified Username Field Change
2018-06-21 02:49 gregk@kgbconsulting.ca New Issue
2018-06-21 02:49 gregk@kgbconsulting.ca File Added: sogo.log.26
2018-06-21 02:52 gregk@kgbconsulting.ca File Added: gbd_bt.txt
2018-06-21 02:52 gregk@kgbconsulting.ca Note Added: 0012920
2019-02-14 19:09 ludovic Note Added: 0013356
2019-02-14 19:09 ludovic Status new => closed
2019-02-14 19:09 ludovic Assigned To => ludovic
2019-02-14 19:09 ludovic Resolution open => no change required