RE: Re: FLUKA crash

From: Chris Theis <Christian.Theis_at_cern.ch>
Date: Thu, 4 Dec 2008 16:57:28 +0100

Dear Noemi,

thanks for the info. This matches exactly what I had already expected.
In order to check whether the graceful execution of your job on the Xeon
was "Monte Carlo coincidence" you could check the sequence of the random
numbers and compare it to the AMD based system.

As I've seen the crashes are closely related to the execution
architecture and not the one of the machine where the executable was
linked or even the whole source was compiled. One has to keep in mind
that the pipeling and prefetch mechanisms of the AMD CPUs is
significantly different from the Intel machines where we have never seen
this behavior so far. It seems that only AMD cores based on the newer
K8/K10 architecture, including their new FPU generation, are affected as
we have been running FLUKA on K7 cores (like the Athlon) without any
problem. In case I find a more satisfying resolution than changing to a
64 bit OS I will let you know.

Cheers
Chris

----------------------------------------------------------------------------
Chris Theis
CERN/SC-RP - European Organization for Nuclear Research
1211 Geneva 23, Switzerland
Phone: +41 22 767 8069 Office: 892-2A-015
e-mail: Christian.Theis_at_cern.ch www.cern.ch/theis
----------------------------------------------------------------------------

-----Original Message-----
From: Finetti Noemi [mailto:noemi.finetti_at_aquila.infn.it]
Sent: Thu 04.12.2008 15:31
To: Chris Theis
Cc: fluka-discuss_at_fluka.org
Subject: {Disarmed} Re: FLUKA crash
=20
Hi Chris,
I have solved the problem of the crash (or ... I wish so) running the=20
batch job on a machine with Linux - Scientific Linux 3.0.5 (32 bit) and=20
2 Intel(R) Xeon(TM) CPU - 2.40GHz! I point out that the executable was=20
compiled on a machine with Linux - Scientific Linux 3.0.5 (32 bit) and 2 =

AMD Opteron(tm) Processor 244 - 1.8GHz. It seams that the problem is=20
exclusively due to the machine employed for running the batch job: on=20
the machine with Linux - Scientific Linux 3.0.5 (32 bit) and 2 Intel(R)=20
Xeon(TM) CPU - 2.40GHz the run ended with success (I hope it was not a=20
... random event!).
Cheers,
noemi

Chris Theis wrote:

>Hi Noemi,
>
>the symptoms you describe are identical to a case I was dealing with
>just a couple of weeks ago. It seems that these mysterious FLUKA =
crashes
>occur on AMD chips with K8/K10 core architectures running under a 32 =
bit
>linux. This is independent of the distribution as I tried several
>different ones, with different math libs etc. Currently, the reason is
>not yet fully clear so I would suggest to follow the steps that Paola
>described in her mail from yesterday as it would be helpful to pinpoint
>the problem to a specific scenario.
>
>In the meantime I can only offer a remedy which solved this problem in
>our case. If you have the possibility you should try to switch to a 64
>bit version of linux. I ran your input on such a machine we previously
>had problems with and which is now running the 64 bit version of linux.
>Your input finished gracefully following exactly the same random number
>sequence like on your machine, just as expected.
>
>All 32 bit programs will be running under the 64 bit OS as well but you
>might need to install a couple of additional libraries for FLUKA to
>work. Some hints can be found in the following presentation of a
>colleague which could be helpful:
>
>http://info-fluka-discussion.web.cern.ch/info-fluka-discussion/talks/Lud=
ovic_From32To64_190407.ppt
>
>I saw that you're sending your jobs to a LSF batch system and probably
>cannot change the OS there. In that case you might try to investigate
>the possibilities to exclude certain architectures (I'd suggest AMD
>Opteron and Phenom CPUs) from the scheduling process.
>Cheers
>
>Chris
>
>
>From: Finetti Noemi [mailto:noemi.finetti_at_aquila.infn.it]=3D20
>Sent: 03 December 2008 10:34
>To: Chris Theis
>Subject: Re: FLUKA crash
>
>=3D20
>
>Hi Christian,
>1) I am using a 32 bit version of SLC3;
>2) when I run my FLUKA job the crash on the batch machines occurs at
>differnt points (sometimes it occurs earlier and sometimes later).
>3) -bash-3.1$ more /proc/cpuinfo
>processor : 0
>vendor_id : AuthenticAMD
>cpu family : 15
>model : 5
>model name : AMD Opteron(tm) Processor 244
>stepping : 10
>cpu MHz : 1800.000
>cache size : 1024 KB
>fdiv_bug : no
>hlt_bug : no
>f00f_bug : no
>coma_bug : no
>fpu : yes
>fpu_exception : yes
>cpuid level : 1
>wp : yes
>flags : fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca
>cmov pat
>pse36 clflush mmx fxsr sse sse2 syscall nx mmxext lm 3dnowext 3dnow ts
>fid vid ttp
>bogomips : 3617.79
>=3D20
>processor : 1
>vendor_id : AuthenticAMD
>cpu family : 15
>model : 5
>model name : AMD Opteron(tm) Processor 244
>stepping : 10
>cpu MHz : 1800.000
>cache size : 1024 KB
>fdiv_bug : no
>hlt_bug : no
>f00f_bug : no
>coma_bug : no
>fpu : yes
>fpu_exception : yes
>cpuid level : 1
>wp : yes
>flags : fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca
>cmov pat
>pse36 clflush mmx fxsr sse sse2 syscall nx mmxext lm 3dnowext 3dnow ts
>fid vid ttp
>bogomips : 3616.16
>
>Thanks in advance,
>noemi
>
>
>Chris Theis wrote:
>
>
>
>Hi Noemi,
>=3D20
>I'm currently looking into your problem and I would need some more
>information. Could you please answer the following questions which =
would
>help me to check whether this problem is similar to one that I have =
seen
>just recently:
>=3D20
>- Are you using the 32 bit or the 64 bit version of SLC3?
>=3D20
>- When you run your FLUKA job does the crash on the batch machines
>always occur at the same
> point or does it sometimes occur earlier and sometimes later?
>=3D20
>- Could you please run the following command on the execution PC and
>send me the output
> "cat /proc/cpuinfo"
>=3D20
>Ciao
>Chris
>=3D20
>------------------------------------------------------------------------=

>Chris Theis
>CERN/SC-RP - European Organization for Nuclear Research
>1211 Geneva 23, Switzerland
>Phone: +41 22 767 8069 Office: 892-2A-015
>e-mail: Christian.Theis@cern.ch www: http://www.cern.ch/theis
>------------------------------------------------------------------------=

>=3D20
>=3D20
> =3D20
>
> -----Original Message-----
> From: owner-fluka-discuss_at_mi.infn.it [mailto:owner-fluka-
> discuss_at_mi.infn.it] On Behalf Of Finetti Noemi
> Sent: 27 November 2008 15:31
> To: fluka-discuss_at_fluka.org
> Subject: FLUKA crash
> =3D20
> Hi all,
> I have installed FLUKA 2008.3 on a linux machine (Model: 2 AMD
>Opteron
> (tm) Processor 244 - 1.8GHz; Operating system: Linux -
>Scientific
> =3D20
>
>Linux
> =3D20
>
> 3.0.5; with g77) where I have compiled my executable file
>(myfluka)
> which calls the user routines USRINI, HISTIN,SOURCE, USRMED,
>USROUT
> (see user_routines.tar.gz). Executing the job in batch (for
>399528
> primaries) the run crashed while running the same job
>interactively
> (for 100
> primaries) every thing was fine. What could be the reasons?
> =3D20
> I point out that the batch job was executed on a 2 Dual Core AMD
> Opteron(tm) Processor 280 - 2.4 GHz with Linux - Scientific
>Linux
> 3.0.8.
> =3D20
> In attachment the LSF message, the gdb output (file
>fluka_gdb.out),
> =3D20
>
>the
> =3D20
>
> files .err, .log, .out, fort.1 and fort.2.
> =3D20
> Thanks in advance,
> noemi
> =3D20
> --
>=3D09
>---------------------------------------------------------------------
> * Address: Dott.ssa Noemi Finetti
> c/o Dipartimento di Fisica dell'Universita' degli Studi
>dell'Aquila
> Via Vetoio - 67010 Coppito - L'Aquila - Italy
> * Phone: +39-0862-433051 (Office); +39-0862-433043 (Laboratory)
> * Fax: +39-0862-433033 (Department).
>=3D09
>---------------------------------------------------------------------
> =3D20
> =3D20
> =3D20
> =3D20
>
>=3D20
>=3D20
> =3D20
>
>
>
>
>
>--=3D20
>---------------------------------------------------------------------
>* Address: Dott.ssa Noemi Finetti
> c/o Dipartimento di Fisica dell'Universita' degli Studi dell'Aquila
> Via Vetoio - 67010 Coppito - L'Aquila - Italy
>* Phone: +39-0862-433051 (Office); +39-0862-433043 (Laboratory)
>* Fax: +39-0862-433033 (Department).
>---------------------------------------------------------------------
>=3D20
>=3D20
>
>------_=3D_NextPart_001_01C95537.0C95CDE5
>Content-Type: text/html; charset=3D"us-ascii"
>Content-Transfer-Encoding: quoted-printable
>
><html xmlns:v=3D3D"urn:schemas-microsoft-com:vml" =3D
>xmlns:o=3D3D"urn:schemas-microsoft-com:office:office" =3D
>xmlns:w=3D3D"urn:schemas-microsoft-com:office:word" =3D
>xmlns:x=3D3D"urn:schemas-microsoft-com:office:excel" =3D
>xmlns:p=3D3D"urn:schemas-microsoft-com:office:powerpoint" =3D
>xmlns:a=3D3D"urn:schemas-microsoft-com:office:access" =3D
>xmlns:dt=3D3D"uuid:C2F41010-65B3-11d1-A29F-00AA00C14882" =3D
>xmlns:s=3D3D"uuid:BDC6E3F0-6DA3-11d1-A2A3-00AA00C14882" =3D
>xmlns:rs=3D3D"urn:schemas-microsoft-com:rowset" =
xmlns:z=3D3D"#RowsetSchema" =3D
>xmlns:b=3D3D"urn:schemas-microsoft-com:office:publisher" =3D
>xmlns:ss=3D3D"urn:schemas-microsoft-com:office:spreadsheet" =3D
>xmlns:c=3D3D"urn:schemas-microsoft-com:office:component:spreadsheet" =
=3D
>xmlns:odc=3D3D"urn:schemas-microsoft-com:office:odc" =3D
>xmlns:oa=3D3D"urn:schemas-microsoft-com:office:activation" =3D
>xmlns:html=3D3D"http://www.w3.org/TR/REC-html40" =3D
>xmlns:q=3D3D"http://schemas.xmlsoap.org/soap/envelope/" =
xmlns:D=3D3D"DAV:" =3D
>xmlns:x2=3D3D"http://schemas.microsoft.com/office/excel/2003/xml" =3D
>xmlns:ois=3D3D"http://schemas.microsoft.com/sharepoint/soap/ois/" =3D
>xmlns:dir=3D3D"http://schemas.microsoft.com/sharepoint/soap/directory/" =
=3D
>xmlns:ds=3D3D"http://www.w3.org/2000/09/xmldsig#" =3D
>xmlns:dsp=3D3D"http://schemas.microsoft.com/sharepoint/dsp" =3D
>xmlns:udc=3D3D"http://schemas.microsoft.com/data/udc" =3D
>xmlns:xsd=3D3D"http://www.w3.org/2001/XMLSchema" =3D
>xmlns:sub=3D3D"http://schemas.microsoft.com/sharepoint/soap/2002/1/alert=
s/"=3D
> xmlns:ec=3D3D"http://www.w3.org/2001/04/xmlenc#" =3D
>xmlns:sp=3D3D"http://schemas.microsoft.com/sharepoint/" =3D
>xmlns:sps=3D3D"http://schemas.microsoft.com/sharepoint/soap/" =3D
>xmlns:xsi=3D3D"http://www.w3.org/2001/XMLSchema-instance" =3D
>xmlns:udcxf=3D3D"http://schemas.microsoft.com/data/udc/xmlfile" =3D
>xmlns:wf=3D3D"http://schemas.microsoft.com/sharepoint/soap/workflow/" =
=3D
>xmlns:mver=3D3D"http://schemas.openxmlformats.org/markup-compatibility/2=
006=3D
>" xmlns:m=3D3D"http://schemas.microsoft.com/office/2004/12/omml" =3D
>xmlns:mrels=3D3D"http://schemas.openxmlformats.org/package/2006/relation=
shi=3D
>ps" =3D
>xmlns:ex12t=3D3D"http://schemas.microsoft.com/exchange/services/2006/typ=
es"=3D
> =3D
>xmlns:ex12m=3D3D"http://schemas.microsoft.com/exchange/services/2006/mes=
sag=3D
>es" xmlns:Z=3D3D"urn:schemas-microsoft-com:" xmlns:st=3D3D"=01" =3D
>xmlns=3D3D"http://www.w3.org/TR/REC-html40">
>
><head>
><meta http-equiv=3D3DContent-Type content=3D3D"text/html; =3D
>charset=3D3Dus-ascii">
><meta name=3D3DGenerator content=3D3D"Microsoft Word 12 (filtered =
medium)">
><style>
><!--
> /* Font Definitions */
> @font-face
> {font-family:Calibri;
> panose-1:2 15 5 2 2 2 4 3 2 4;}
>@font-face
> {font-family:Tahoma;
> panose-1:2 11 6 4 3 5 4 4 2 4;}
>@font-face
> {font-family:Consolas;
> panose-1:2 11 6 9 2 2 4 3 2 4;}
> /* Style Definitions */
> p.MsoNormal, li.MsoNormal, div.MsoNormal
> {margin:0cm;
> margin-bottom:.0001pt;
> font-size:12.0pt;
> font-family:"Times New Roman","serif";
> color:black;}
>a:link, span.MsoHyperlink
> {mso-style-priority:99;
> color:blue;
> text-decoration:underline;}
>a:visited, span.MsoHyperlinkFollowed
> {mso-style-priority:99;
> color:purple;
> text-decoration:underline;}
>pre
> {mso-style-priority:99;
> mso-style-link:"HTML Preformatted Char";
> margin:0cm;
> margin-bottom:.0001pt;
> font-size:10.0pt;
> font-family:"Courier New";
> color:black;}
>span.HTMLPreformattedChar
> {mso-style-name:"HTML Preformatted Char";
> mso-style-priority:99;
> mso-style-link:"HTML Preformatted";
> font-family:Consolas;
> color:black;}
>span.EmailStyle19
> {mso-style-type:personal-reply;
> font-family:"Calibri","sans-serif";
> color:#1F497D;}
>.MsoChpDefault
> {mso-style-type:export-only;
> font-size:10.0pt;}
>@page Section1
> {size:612.0pt 792.0pt;
> margin:72.0pt 72.0pt 72.0pt 72.0pt;}
>div.Section1
> {page:Section1;}
>-->
></style>
><!--[if gte mso 9]><xml>
> <o:shapedefaults v:ext=3D3D"edit" spidmax=3D3D"1026" />
></xml><![endif]--><!--[if gte mso 9]><xml>
> <o:shapelayout v:ext=3D3D"edit">
> <o:idmap v:ext=3D3D"edit" data=3D3D"1" />
> </o:shapelayout></xml><![endif]-->
></head>
>
><body bgcolor=3D3Dwhite lang=3D3DEN-GB link=3D3Dblue vlink=3D3Dpurple>
>
><div class=3D3DSection1>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'>Hi Noemi,<o:p></o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'>the symptoms you describe are identical to a case I was =
=3D
>dealing
>with just a couple of weeks ago. It seems that these mysterious FLUKA =
=3D
>crashes
>occur on AMD chips with K8/K10 core architectures running under a 32 =
bit =3D
>linux.
>This is independent of the distribution as I tried several different =
=3D
>ones, with
>different math libs etc. Currently, the reason is not yet fully clear =
so =3D
>I
>would suggest to follow the steps that Paola described in her mail from
>yesterday as it would be helpful to pinpoint the problem to a specific
>scenario.<o:p></o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'>In the meantime I can only offer a remedy which solved =
=3D
>this
>problem in our case. If you have the possibility you should try to =3D
>switch to a
>64 bit version of linux. I ran your input on such a machine we =3D
>previously had
>problems with and which is now running the 64 bit version of linux. =
Your =3D
>input finished
>gracefully following exactly the same random number sequence like on =
=3D
>your
>machine, just as expected.<o:p></o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'>All 32 bit programs will be running under the 64 bit OS =
=3D
>as well
>but you might need to install a couple of additional libraries for =
FLUKA =3D
>to
>work. Some hints can be found in the following presentation of a =3D
>colleague which
>could be helpful:<o:p></o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><a
>href=3D3D"http://info-fluka-discussion.web.cern.ch/info-fluka-discussion=
/ta=3D
>lks/Ludovic_From32To64_190407.ppt">http://info-fluka-discussion.web.cern=
.=3D
>ch/info-fluka-discussion/talks/Ludovic_From32To64_190407.ppt</a><o:p></o=
:=3D
>p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'>I saw that you're sending your jobs to a LSF batch =3D
>system and
>probably cannot change the OS there. In that case you might try to =3D
>investigate
>the possibilities to exclude certain architectures (I'd suggest =3D
>AMD
>Opteron and Phenom CPUs) from the scheduling process. =3D
><o:p></o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'>Cheers<o:p></o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'>Chris<o:p></o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><p class=3D3DMsoNormal><span =3D
>style=3D3D'font-size:11.0pt;font-family:"Calibri","sans-serif";
>color:#1F497D'><o:p>&nbsp;</o:p></span></p>
>
><div style=3D3D'border:none;border-left:solid blue 1.5pt;padding:0cm =
0cm =3D
>0cm 4.0pt'>
>
><div>
>
><div style=3D3D'border:none;border-top:solid #B5C4DF =
1.0pt;padding:3.0pt =3D
>0cm 0cm 0cm'>
>
><p class=3D3DMsoNormal><b><span lang=3D3DEN-US =3D
>style=3D3D'font-size:10.0pt;font-family:
>"Tahoma","sans-serif";color:windowtext'>From:</span></b><span =3D
>lang=3D3DEN-US
>style=3D3D'font-size:10.0pt;font-family:"Tahoma","sans-serif";color:wind=
owt=3D
>ext'> Finetti
>Noemi [mailto:noemi.finetti_at_aquila.infn.it] <br>
><b>Sent:</b> 03 December 2008 10:34<br>
><b>To:</b> Chris Theis<br>
><b>Subject:</b> Re: FLUKA crash<o:p></o:p></span></p>
>
></div>
>
></div>
>
><p class=3D3DMsoNormal><o:p>&nbsp;</o:p></p>
>
><p class=3D3DMsoNormal>Hi Christian,<br>
>1) I am using a 32 bit version of SLC3;<br>
>2) when I run my FLUKA job the crash on the batch machines occurs at =
=3D
>differnt points&nbsp;
>(sometimes it occurs earlier and sometimes later).<br>
>3)&nbsp; -bash-3.1$ more /proc/cpuinfo<br>
>processor&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 0<br>
>vendor_id&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : AuthenticAMD<br>
>cpu family&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 15<br>
>model&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : =3D
>5<br>
>model name&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : AMD Opteron(tm) Processor =
=3D
>244<br>
>stepping&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 10<br>
>cpu MHz&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 1800.000<br>
>cache size&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 1024 KB<br>
>fdiv_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>hlt_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>f00f_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>coma_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>fpu&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
s=3D
>p; :
>yes<br>
>fpu_exception&nbsp;&nbsp; : yes<br>
>cpuid level&nbsp;&nbsp;&nbsp;&nbsp; : 1<br>
>wp&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p=3D
>;&nbsp;
>: yes<br>
>flags&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : fpu =
=3D
>vme de
>pse tsc msr pae mce cx8 apic mtrr pge mca cmov pat<br>
>pse36 clflush mmx fxsr sse sse2 syscall nx mmxext lm 3dnowext 3dnow ts =
=3D
>fid vid
>ttp<br>
>bogomips&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 3617.79<br>
>&nbsp;<br>
>processor&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 1<br>
>vendor_id&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : AuthenticAMD<br>
>cpu family&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 15<br>
>model&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : =3D
>5<br>
>model name&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : AMD Opteron(tm) Processor =
=3D
>244<br>
>stepping&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 10<br>
>cpu MHz&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 1800.000<br>
>cache size&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 1024 KB<br>
>fdiv_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>hlt_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>f00f_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>coma_bug&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : no<br>
>fpu&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
s=3D
>p; :
>yes<br>
>fpu_exception&nbsp;&nbsp; : yes<br>
>cpuid level&nbsp;&nbsp;&nbsp;&nbsp; : 1<br>
>wp&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbs=
p=3D
>;&nbsp;
>: yes<br>
>flags&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : fpu =
=3D
>vme de
>pse tsc msr pae mce cx8 apic mtrr pge mca cmov pat<br>
>pse36 clflush mmx fxsr sse sse2 syscall nx mmxext lm 3dnowext 3dnow ts =
=3D
>fid vid
>ttp<br>
>bogomips&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; : 3616.16<br>
><br>
>Thanks in advance,<br>
>noemi<br>
><br>
><br>
>Chris Theis wrote:<br>
><br>
><o:p></o:p></p>
>
><pre>Hi Noemi,<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>I'm =3D
>currently looking into your problem and I would need some =3D
>more<o:p></o:p></pre><pre>information. Could you please answer the =3D
>following questions which would<o:p></o:p></pre><pre>help me to check =
=3D
>whether this problem is similar to one that I have =3D
>seen<o:p></o:p></pre><pre>just =3D
>recently:<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>- Are you =
=3D
>using the 32 bit or the 64 bit version of =3D
>SLC3?<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>- When you run =
=3D
>your FLUKA job does the crash on the batch =3D
>machines<o:p></o:p></pre><pre>always occur at the =3D
>same<o:p></o:p></pre><pre>&nbsp; point or does it sometimes occur =3D
>earlier and sometimes =3D
>later?<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>- Could you =3D
>please run the following command on the execution PC =3D
>and<o:p></o:p></pre><pre>send me the output<o:p></o:p></pre><pre>&nbsp; =
=3D
>&quot;cat =3D
>/proc/cpuinfo&quot;<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>Cia=
o=3D
><o:p></o:p></pre><pre>Chris<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre>=
<=3D
>pre>--------------------------------------------------------------------=
-=3D
>---<o:p></o:p></pre><pre>Chris Theis<o:p></o:p></pre><pre>CERN/SC-RP - =
=3D
>European Organization for Nuclear Research<o:p></o:p></pre><pre>1211 =
=3D
>Geneva 23, Switzerland<o:p></o:p></pre><pre>Phone: +41 22 767 =3D
>8069&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&n=
b=3D
>sp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Office: =3D
>892-2A-015<o:p></o:p></pre><pre>e-mail: <a
>href=3D3D"mailto:Christian.Theis_at_cern.ch">Christian.Theis_at_cern.ch</a>&nb=
sp;=3D
>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; www: <a
>href=3D3D"http://www.cern.ch/theis">http://www.cern.ch/theis</a><o:p></o=
:p>=3D
></pre><pre>-------------------------------------------------------------=
-=3D
>----------<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre><o:p>&nbsp;<=
/=3D
>o:p></pre><pre>&nbsp; <o:p></o:p></pre>
>
><blockquote =3D
>style=3D3D'margin-top:5.0pt;margin-bottom:5.0pt'><pre>-----Original =3D
>Message-----<o:p></o:p></pre><pre>From: <a
>href=3D3D"mailto:owner-fluka-discuss_at_mi.infn.it">owner-fluka-discuss_at_mi.=
inf=3D
>n.it</a> [<a
>href=3D3D"mailto:owner-fluka">mailto:owner-fluka</a>-<o:p></o:p></pre><p=
re>=3D
><a
>href=3D3D"mailto:discuss_at_mi.infn.it">discuss_at_mi.infn.it</a>] On Behalf =
Of =3D
>Finetti Noemi<o:p></o:p></pre><pre>Sent: 27 November 2008 =3D
>15:31<o:p></o:p></pre><pre>To: <a
>href=3D3D"mailto:fluka-discuss_at_fluka.org">fluka-discuss_at_fluka.org</a><o:=
p><=3D
>/o:p></pre><pre>Subject: FLUKA =3D
>crash<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>Hi =3D
>all,<o:p></o:p></pre><pre>I have installed FLUKA 2008.3 on a linux =3D
>machine (Model: 2 AMD Opteron<o:p></o:p></pre><pre>(tm) Processor 244 - =
=3D
>1.8GHz; Operating system: Linux - =3D
>Scientific<o:p></o:p></pre><pre>&nbsp;&nbsp;&nbsp; =3D
><o:p></o:p></pre></blockquote>
>
><pre>Linux<o:p></o:p></pre><pre>&nbsp; <o:p></o:p></pre>
>
><blockquote =
style=3D3D'margin-top:5.0pt;margin-bottom:5.0pt'><pre>3.0.5; =3D
>with g77) where I have compiled my executable file =3D
>(myfluka)<o:p></o:p></pre><pre>which calls the user routines USRINI, =
=3D
>HISTIN,SOURCE, USRMED, USROUT<o:p></o:p></pre><pre>(see =3D
>user_routines.tar.gz). Executing the job in batch (for =3D
>399528<o:p></o:p></pre><pre>primaries) the run crashed while running =
the =3D
>same job interactively<o:p></o:p></pre><pre>(for =3D
>100<o:p></o:p></pre><pre>primaries) every thing was fine. What could be =
=3D
>the reasons?<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>I point =
=3D
>out that the batch job was executed on a 2 Dual Core =3D
>AMD<o:p></o:p></pre><pre>Opteron(tm) Processor 280 - 2.4 GHz with Linux =
=3D
>- Scientific =3D
>Linux<o:p></o:p></pre><pre>3.0.8.<o:p></o:p></pre><pre><o:p>&nbsp;</o:p>=
<=3D
>/pre><pre>In attachment the LSF message, the gdb output (file =3D
>fluka_gdb.out),<o:p></o:p></pre><pre>&nbsp;&nbsp;&nbsp; =3D
><o:p></o:p></pre></blockquote>
>
><pre>the<o:p></o:p></pre><pre>&nbsp; <o:p></o:p></pre>
>
><blockquote style=3D3D'margin-top:5.0pt;margin-bottom:5.0pt'><pre>files =
=3D
>.err, .log, .out, fort.1 and =3D
>fort.2.<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>Thanks in =3D
>advance,<o:p></o:p></pre><pre>noemi<o:p></o:p></pre><pre><o:p>&nbsp;</o:=
p=3D
> =20
>
>></pre><pre>--<o:p></o:p></pre><pre>------------------------------------=
-=3D
>> =20
>>
>--------------------------------<o:p></o:p></pre><pre>* Address: =3D
>Dott.ssa Noemi Finetti<o:p></o:p></pre><pre>&nbsp;&nbsp; c/o =3D
>Dipartimento di Fisica dell'Universita' degli Studi =3D
>dell'Aquila<o:p></o:p></pre><pre>&nbsp;&nbsp; Via Vetoio - 67010 =
Coppito =3D
>- L'Aquila - Italy<o:p></o:p></pre><pre>* Phone: +39-0862-433051 =3D
>(Office); +39-0862-433043 (Laboratory)<o:p></o:p></pre><pre>* Fax: =3D
>+39-0862-433033 =3D
>(Department).<o:p></o:p></pre><pre>-------------------------------------=
-=3D
>-------------------------------<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></=
p=3D
>re><pre><o:p>&nbsp;</o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>&nbsp;&n=
b=3D
>sp;&nbsp; <o:p></o:p></pre></blockquote>
>
><pre><o:p>&nbsp;</o:p></pre><pre><o:p>&nbsp;</o:p></pre><pre>&nbsp; =3D
><o:p></o:p></pre>
>
><p class=3D3DMsoNormal><br>
><br>
><o:p></o:p></p>
>
><pre>-- =3D
><o:p></o:p></pre><pre>--------------------------------------------------=
-=3D
>------------------<o:p></o:p></pre><pre>* Address: Dott.ssa Noemi =3D
>Finetti<o:p></o:p></pre><pre>&nbsp; c/o Dipartimento di Fisica =3D
>dell'Universita' degli Studi dell'Aquila<o:p></o:p></pre><pre>&nbsp; =
Via =3D
>Vetoio - 67010 Coppito - L'Aquila - Italy<o:p></o:p></pre><pre>* Phone: =
=3D
>+39-0862-433051 (Office); +39-0862-433043 =3D
>(Laboratory)<o:p></o:p></pre><pre>* Fax: +39-0862-433033 =3D
>(Department).<o:p></o:p></pre><pre>-------------------------------------=
-=3D
>-------------------------------<o:p></o:p></pre><pre><o:p>&nbsp;</o:p></=
p=3D
>re><pre><o:p>&nbsp;</o:p></pre></div>
>
></div>
>
></body>
>
></html>
>
>------_=3D_NextPart_001_01C95537.0C95CDE5--
>
>
> =20
>

--=20
---------------------------------------------------------------------
* Address: Dott.ssa Noemi Finetti
  c/o Dipartimento di Fisica dell'Universita' degli Studi dell'Aquila
  Via Vetoio - 67010 Coppito - L'Aquila - Italy
* Phone: +39-0862-433051 (Office); +39-0862-433043 (Laboratory)
* Fax: +39-0862-433033 (Department).
---------------------------------------------------------------------
Received on Thu Dec 04 2008 - 23:50:32 CET

This archive was generated by hypermail 2.2.0 : Thu Dec 04 2008 - 23:50:32 CET