# fix ave/chunk command ## Syntax ``` LAMMPS fix ID group-ID ave/chunk Nevery Nrepeat Nfreq chunkID value1 value2 ... keyword args ... ``` - ID, group-ID are documented in [fix](fix) command - ave/chunk = style name of this fix command - Nevery = use input values every this many timesteps - Nrepeat = \# of times to use input values for calculating averages - Nfreq = calculate averages every this many timesteps - chunkID = ID of [compute chunk/atom](compute_chunk_atom) command - one or more input values can be listed - value = *vx*, *vy*, *vz*, *fx*, *fy*, *fz*, *density/mass*, *density/number*, *mass*, *temp*, c_ID, c_ID\[I\], f_ID, f_ID\[I\], v_name vx,vy,vz,fx,fy,fz,mass = atom attribute (velocity, force component, mass) density/number, density/mass = number or mass density (per volume) temp = temperature c_ID = per-atom vector calculated by a compute with ID c_ID[I] = Ith column of per-atom array calculated by a compute with ID, I can include wildcard (see below) f_ID = per-atom vector calculated by a fix with ID f_ID[I] = Ith column of per-atom array calculated by a fix with ID, I can include wildcard (see below) v_name = per-atom vector calculated by an atom-style variable with name - zero or more keyword/arg pairs may be appended - keyword = *norm* or *ave* or *bias* or *adof* or *cdof* or *file* or *overwrite* or *format* or *title1* or *title2* or *title3* *norm* arg = *all* or *sample* or *none* = how output on *Nfreq* steps is normalized all = output is sum of atoms across all *Nrepeat* samples, divided by atom count sample = output is sum of *Nrepeat* sample averages, divided by *Nrepeat* none = output is sum of *Nrepeat* sample sums, divided by *Nrepeat* *ave* args = *one* or *running* or *window M* one = output new average value every Nfreq steps running = output cumulative average of all previous Nfreq steps window M = output average of M most recent Nfreq steps *bias* arg = bias-ID bias-ID = ID of a temperature compute that removes a velocity bias for temperature calculation *adof* value = dof_per_atom dof_per_atom = define this many degrees-of-freedom per atom for temperature calculation *cdof* value = dof_per_chunk dof_per_chunk = define this many degrees-of-freedom per chunk for temperature calculation *file* arg = filename filename = file to write results to *overwrite* arg = none = overwrite output file with only latest output *format* arg = string string = C-style format string *title1* arg = string string = text to print as 1st line of output file *title2* arg = string string = text to print as 2nd line of output file *title3* arg = string string = text to print as 3rd line of output file ## Examples ``` LAMMPS fix 1 all ave/chunk 10000 1 10000 binchunk c_myCentro title1 "My output values" fix 1 flow ave/chunk 100 10 1000 molchunk vx vz norm sample file vel.profile fix 1 flow ave/chunk 100 5 1000 binchunk density/mass ave running fix 1 flow ave/chunk 100 5 1000 binchunk density/mass ave running ``` ::::: note ::: title Note ::: ::: versionchanged 31May2016 ::: If you are trying to replace a deprecated fix ave/spatial command with the newer, more flexible fix ave/chunk and [compute chunk/atom](compute_chunk_atom) commands, you simply need to split the fix ave/spatial arguments across the two new commands. For example, this command: ``` LAMMPS fix 1 flow ave/spatial 100 10 1000 y 0.0 1.0 vx vz norm sample file vel.profile ``` could be replaced by: ``` LAMMPS compute cc1 flow chunk/atom bin/1d y 0.0 1.0 fix 1 flow ave/chunk 100 10 1000 cc1 vx vz norm sample file vel.profile ``` ::::: ## Description Use one or more per-atom vectors as inputs every few timesteps, sum the values over the atoms in each chunk at each timestep, then average the per-chunk values over longer timescales. The resulting chunk averages can be used by other [output commands](Howto_output) such as [thermo_style custom](thermo_style), and can also be written to a file. In LAMMPS, chunks are collections of atoms defined by a [compute chunk/atom](compute_chunk_atom) command, which assigns each atom to a single chunk (or no chunk). The ID for this command is specified as chunkID. For example, a single chunk could be the atoms in a molecule or atoms in a spatial bin. See the [compute chunk/atom](compute_chunk_atom) page and the [Howto chunk](Howto_chunk) page for details of how chunks can be defined and examples of how they can be used to measure properties of a system. Note that if the [compute chunk/atom](compute_chunk_atom) command defines spatial bins, the fix ave/chunk command performs a similar computation as the [fix ave/grid](fix_ave_grid) command. However, the per-bin outputs from the fix ave/chunk command are global; each processor stores a copy of the entire set of bin data. By contrast, the [fix ave/grid](fix_ave_grid) command uses a distributed grid where each processor owns a subset of the bins. Thus it is more efficient to use the [fix ave/grid](fix_ave_grid) command when the grid is large and a simulation is run on many processors. Note that only atoms in the specified group contribute to the summing and averaging calculations. The [compute chunk/atom](compute_chunk_atom) command defines its own group as well as an optional region. Atoms will have a chunk ID = 0, meaning they belong to no chunk, if they are not in that group or region. Thus you can specify the \"all\" group for this command if you simply want to use the chunk definitions provided by chunkID. Each specified per-atom value can be an atom attribute (position, velocity, force component), a number or mass density, a mass or temperature, or the result of a [compute](compute) or [fix](fix) or the evaluation of an atom-style [variable](variable). In the latter cases, the compute, fix, or variable must produce a per-atom quantity, not a global quantity. Note that the [compute property/atom](compute_property_atom) command provides access to any attribute defined and stored by atoms. If you wish to time-average global quantities from a compute, fix, or variable, then see the [fix ave/time](fix_ave_time) command. The per-atom values of each input vector are summed and averaged independently of the per-atom values in other input vectors. [Computes](compute) that produce per-atom quantities are those which have the word *atom* in their style name. See the doc pages for individual [fixes](fix) to determine which ones produce per-atom quantities. [Variables](variable) of style *atom* are the only ones that can be used with this fix since all other styles of variable produce global quantities. Note that for values from a compute or fix that produces a per-atom array (multiple values per atom), the bracketed index I can be specified using a wildcard asterisk with the index to effectively specify multiple values. This takes the form \"\*\" or \"\*n\" or \"n\*\" or \"m\*n\". If $N$ = the size of the vector (for *mode* = scalar) or the number of columns in the array (for *mode* = vector), then an asterisk with no numeric values means all indices from 1 to $N$. A leading asterisk means all indices from 1 to n (inclusive). A trailing asterisk means all indices from m to $N$ (inclusive). A middle asterisk means all indices from m to n (inclusive). Using a wildcard is the same as if the individual columns of the array had been listed one by one. For example, these two fix ave/chunk commands are equivalent, since the [compute property/atom](compute_property_atom) command creates, in this case, a per-atom array with three columns: ``` LAMMPS compute myAng all property/atom angmomx angmomy angmomz fix 1 all ave/chunk 100 1 100 cc1 c_myAng[*] file tmp.angmom fix 2 all ave/chunk 100 1 100 cc1 c_myAng[1] c_myAng[2] c_myAng[3] file tmp.angmom ``` :::: note ::: title Note ::: This fix works by creating an array of size $N_\text{chunk} \times N_\text{values}$ on each processor. $N_\text{chunk}$ is the number of chunks, which is defined by the [compute chunk/atom](compute_chunk_atom) command. $N_\text{values}$ is the number of input values specified. Each processor loops over its atoms, tallying its values to the appropriate chunk. Then the entire array is summed across all processors. This means that using a large number of chunks will incur an overhead in memory and computational cost (summing across processors), so be careful to define a reasonable number of chunks. :::: ------------------------------------------------------------------------ The $N_\text{every}$, $N_\text{repeat}$, and $N_\text{freq}$ arguments specify on what time steps the input values will be accessed and contribute to the average. The final averaged quantities are generated on time steps that are a multiples of $N_\text{freq}$. The average is over $N_\text{repeat}$ quantities, computed in the preceding portion of the simulation every $N_\text{every}$ time steps. $N_\text{freq}$ must be a multiple of $N_\text{every}$ and $N_\text{every}$ must be non-zero even if $N_\text{repeat} = 1$. Also, the time steps contributing to the average value cannot overlap (i.e., $N_\text{repeat}N_\text{every}$ cannot exceed $N_\text{freq}$). For example, if $N_\text{every}=2$, $N_\text{repeat}=6$, and $N_\text{freq}=100$, then values on time steps 90, 92, 94, 96, 98, 100 will be used to compute the final average on time step 100. Similarly for time steps 190, 192, 194, 196, 198, 200 on time step 200, etc. If $N_\text{repeat}=1$ and $N_\text{freq} = 100$, then no time averaging is done; values are simply generated on time steps 100, 200, etc. Each input value can also be averaged over the atoms in each chunk. The way the averaging is done across the $N_\text{repeat}$ time steps to produce output on the $N_\text{freq}$ time steps, and across multiple $N_\text{freq}$ outputs, is determined by the *norm* and *ave* keyword settings, as discussed below. :::: note ::: title Note ::: To perform per-chunk averaging within a $N_\text{freq}$ time window, the number of chunks $N_\text{chunk}$ defined by the [compute chunk/atom](compute_chunk_atom) command must remain constant. If the *ave* keyword is set to *running* or *window* then $N_\text{chunk}$ must remain constant for the duration of the simulation. This fix forces the chunk/atom compute specified by chunkID to hold $N_\text{chunk}$ constant for the appropriate time windows, by not allowing it to re-calculate $N_\text{chunk}$, which can also affect how it assigns chunk IDs to atoms. This is particularly important to understand if the chunks defined by the [compute chunk/atom](compute_chunk_atom) command are spatial bins. If its *units* keyword is set to *box* or *lattice*, then the number of bins $N_\text{chunk}$ and size of each bin will be fixed over the $N_\text{freq}$ time window, which can affect which atoms are discarded if the simulation box size changes. If its *units* keyword is set to *reduced*, then the number of bins $N_\text{chunk}$ will still be fixed, but the size of each bin can vary at each time step if the simulation box size changes (e.g., for an NPT simulation). :::: ------------------------------------------------------------------------ The atom attribute values (*vx*, *vy*, *vz*, *fx*, *fy*, *fz*, *mass*) are self-explanatory. As noted above, any other atom attributes can be used as input values to this fix by using the [compute property/atom](compute_property_atom) command and then specifying an input value from that compute. The *density/number* value means the number density is computed for each chunk (i.e., number/volume). The *density/mass* value means the mass density is computed for each chunk (i.e., total-mass/volume). The output values are in units of 1/volume or mass density (mass/volume). See the [units](units) command page for the definition of density for each choice of units (e.g., g/cm$^3$). If the chunks defined by the [compute chunk/atom](compute_chunk_atom) command are spatial bins, the volume is the bin volume. Otherwise, it is the volume of the entire simulation box. The *temp* value means the temperature is computed for each chunk, by the formula $$\text{KE} = \frac{\text{DOF}}{2} k_B T,$$ where KE is the total kinetic energy of the chunk of atoms (sum of $\frac{1}{2} m v^2$), DOF is the the total number of degrees of freedom for all atoms in the chunk, $k_B$ is the Boltzmann constant, and $T$ is the absolute temperature. The DOF is calculated as $N$\*adof + cdof, where $N$ is the number of atoms in the chunk, adof is the number of degrees of freedom per atom, and cdof is the number of degrees of freedom per chunk. By default, adof = 2 or 3 = dimensionality of system, as set via the [dimension](dimension) command, and cdof = 0.0. This gives the usual formula for temperature. Note that currently this temperature only includes translational degrees of freedom for each atom. No rotational degrees of freedom are included for finite-size particles. Also, no degrees of freedom are subtracted for any velocity bias or constraints that are applied, such as [compute temp/partial](compute_temp_partial), [fix shake](fix_shake), or [fix rigid](fix_rigid). This is because those degrees of freedom (e.g., a constrained bond) could apply to sets of atoms that are both included and excluded from a specific chunk, and hence the concept is somewhat ill-defined. In some cases, you can use the *adof* and *cdof* keywords to adjust the calculated degrees of freedom appropriately, as explained below. Also note that a bias can be subtracted from atom velocities before they are used in the above formula for KE, by using the *bias* keyword. This allows, for example, a thermal temperature to be computed after removal of a flow velocity profile. Note that the per-chunk temperature calculated by this fix and the [compute temp/chunk](compute_temp_chunk) command can be different. The compute calculates the temperature for each chunk for a single snapshot. This fix can do that but can also time average those values over many snapshots, or it can compute a temperature as if the atoms in the chunk on different time steps were collected together as one set of atoms to calculate their temperature. The compute allows the center-of-mass velocity of each chunk to be subtracted before calculating the temperature; this fix does not. If a value begins with \"c\_\", a compute ID must follow which has been previously defined in the input script. If no bracketed integer is appended, the per-atom vector calculated by the compute is used. If a bracketed integer is appended, the Ith column of the per-atom array calculated by the compute is used. Users can also write code for their own compute styles and [add them to LAMMPS](Modify). See the discussion above for how I can be specified with a wildcard asterisk to effectively specify multiple values. If a value begins with \"f\_\", a fix ID must follow which has been previously defined in the input script. If no bracketed integer is appended, the per-atom vector calculated by the fix is used. If a bracketed integer is appended, the Ith column of the per-atom array calculated by the fix is used. Note that some fixes only produce their values on certain time steps, which must be compatible with $N_\text{every}$, else an error results. Users can also write code for their own fix styles and [add them to LAMMPS](Modify). See the discussion above for how I can be specified with a wildcard asterisk to effectively specify multiple values. If a value begins with \"v\_\", a variable name must follow which has been previously defined in the input script. Variables of style *atom* can reference thermodynamic keywords and various per-atom attributes, or invoke other computes, fixes, or variables when they are evaluated, so this is a very general means of generating per-atom quantities to average within chunks. ------------------------------------------------------------------------ Additional optional keywords also affect the operation of this fix and its outputs. The *norm* keyword affects how averaging is done for the per-chunk values that are output every $N_\text{freq}$ time steps. It the *norm* setting is *all*, which is the default, a chunk value is summed over all atoms in all $N_\text{repeat}$ samples, as is the count of atoms in the chunk. The averaged output value for the chunk on the $N_\text{freq}$ time steps is Total-sum / Total-count. In other words it is an average over atoms across the entire $N_\text{freq}$ timescale. For the *density/number* and *density/mass* values, the volume (bin volume or system volume) used in the final normalization will be the volume at the final $N_\text{freq}$ time step. For the *temp* values, degrees of freedom and kinetic energy are summed separately across the entire $N_\text{freq}$ timescale, and the output value is calculated by dividing those two sums. If the *norm* setting is *sample*, the chunk value is summed over atoms for each sample, as is the count, and an \"average sample value\" is computed for each sample (i.e., Sample-sum / Sample-count). The output value for the chunk on the $N_\text{freq}$ time steps is the average of the $N_\text{repeat}$ \"average sample values\" (i.e., the sum of $N_\text{repeat}$ \"average sample values\" divided by $N_\text{repeat}$). In other words, it is an average of an average. For the *density/number* and *density/mass* values, the volume (bin volume or system volume) used in the per-sample normalization will be the current volume at each sampling step. If the *norm* setting is *none*, a similar computation as for the *sample* setting is done, except the individual \"average sample values\" are \"summed sample values\". A summed sample value is simply the chunk value summed over atoms in the sample, without dividing by the number of atoms in the sample. The output value for the chunk on the $N_\text{freq}$ timesteps is the average of the $N_\text{repeat}$ \"summed sample values\" (i.e., the sum of $N_\text{repeat}$ \"summed sample values\" divided by $N_\text{repeat}$). For the *density/number* and *density/mass* values, the volume (bin volume or system volume) used in the per-sample sum normalization will be the current volume at each sampling step. ------------------------------------------------------------------------ The *ave* keyword determines how the per-chunk values produced every $N_\text{freq}$ steps are averaged with values produced on previous steps that were multiples of $N_\text{freq}$, before they are accessed by another output command or written to a file. If the *ave* setting is *one*, which is the default, then the chunk values produced on timesteps that are multiples of $N_\text{freq}$ are independent of each other; they are output as-is without further averaging. If the *ave* setting is *running*, then the chunk values produced on timesteps that are multiples of $N_\text{freq}$ are summed and averaged in a cumulative sense before being output. Each output chunk value is thus the average of the chunk value produced on that timestep with all preceding values for the same chunk. This running average begins when the fix is defined; it can only be restarted by deleting the fix via the [unfix](unfix) command, or re-defining the fix by re-specifying it. If the *ave* setting is *window*, then the chunk values produced on timesteps that are multiples of $N_\text{freq}$ are summed and averaged within a moving \"window\" of time, so that the last $M$ values for the same chunk are used to produce the output. For example, if $M = 3$ and $N_\text{freq} = 1000$, then the output on step 10000 will be the average of the individual chunk values on time steps 8000, 9000, and 10000. Outputs on early steps will average over less than $M$ values if they are not available. ------------------------------------------------------------------------ The *bias* keyword specifies the ID of a temperature compute that removes a \"bias\" velocity from each atom, specified as *bias-ID*. It is only used when the *temp* value is calculated, to compute the thermal temperature of each chunk after the translational kinetic energy components have been altered in a prescribed way (e.g., to remove a flow velocity profile). See the doc pages for individual computes that calculate a temperature to see which ones implement a bias. The *adof* and *cdof* keywords define the values used in the degree of freedom (DOF) formula described above for temperature calculation for each chunk. They are only used when the *temp* value is calculated. They can be used to calculate a more appropriate temperature for some kinds of chunks. Here are three examples: If spatially binned chunks contain some number of water molecules and [fix shake](fix_shake) is used to make each molecule rigid, then you could calculate a temperature with six degrees of freedom (DOF) (three translational, three rotational) per molecule by setting *adof* to 2.0. If [compute temp/partial](compute_temp_partial) is used with the *bias* keyword to only allow the $x$ component of velocity to contribute to the temperature, then *adof* = 1.0 would be appropriate. If each chunk consists of a large molecule, with some number of its bonds constrained by [fix shake](fix_shake) or the entire molecule by [fix rigid/small](fix_rigid), *adof* = 0.0 and *cdof* could be set to the remaining degrees of freedom for the entire molecule (entire chunk in this case), that is, 6 for 3d or 3 for 2d for a rigid molecule. ------------------------------------------------------------------------ The *file* keyword allows a filename to be specified. Every $N_\text{freq}$ timesteps, a section of chunk info will be written to a text file in the following format. A line with the timestep and number of chunks is written. Then one line per chunk is written, containing the chunk ID $(1-N_\text{chunk}),$ an optional original ID value, optional coordinate values for chunks that represent spatial bins, the number of atoms in the chunk, and one or more calculated values. More explanation of the optional values is given below. The number of values in each line corresponds to the number of values specified in the fix ave/chunk command. The number of atoms and the value(s) are summed or average quantities, as explained above. The *overwrite* keyword will continuously overwrite the output file with the latest output, so that it only contains one timestep worth of output. This option can only be used with the *ave running* setting. The *format* keyword sets the numeric format of each value when it is printed to a file via the *file* keyword. Note that all values are floating point quantities. The default format is %g. You can specify a higher precision if desired (e.g., %20.16g). The *title1* and *title2* and *title3* keywords allow specification of the strings that will be printed as the first three lines of the output file, assuming the *file* keyword was used. LAMMPS uses default values for each of these, so they do not need to be specified. By default, these header lines are as follows: # Chunk-averaged data for fix ID and group name # Timestep Number-of-chunks # Chunk (OrigID) (Coord1) (Coord2) (Coord3) Ncount value1 value2 ... In the first line, ID and name are replaced with the fix-ID and group name. The second line describes the two values that are printed at the first of each section of output. In the third line the values are replaced with the appropriate value names (e.g., *fx* or c_myCompute\[2\]). The words in parenthesis only appear with corresponding columns if the chunk style specified for the [compute chunk/atom](compute_chunk_atom) command supports them. The OrigID column is only used if the *compress* keyword was set to *yes* for the [compute chunk/atom](compute_chunk_atom) command. This means that the original chunk IDs (e.g., molecule IDs) will have been compressed to remove chunk IDs with no atoms assigned to them. Thus a compressed chunk ID of 3 may correspond to an original chunk ID or molecule ID of 415. The OrigID column will list 415 for the third chunk. The CoordN columns only appear if a *binning* style was used in the [compute chunk/atom](compute_chunk_atom) command. For *bin/1d*, *bin/2d*, and *bin/3d* styles the column values are the center point of the bin in the corresponding dimension. Just Coord1 is used for *bin/1d*, Coord2 is added for *bin/2d*, Coord3 is added for *bin/3d*. For *bin/sphere*, just Coord1 is used, and it is the radial coordinate. For *bin/cylinder*, Coord1 and Coord2 are used. Coord1 is the radial coordinate (away from the cylinder axis), and coord2 is the coordinate along the cylinder axis. Note that if the value of the *units* keyword used in the [compute chunk/atom command](compute_chunk_atom) is *box* or *lattice*, the coordinate values will be in distance [units](units). If the value of the *units* keyword is *reduced*, the coordinate values will be in unitless reduced units (0\--1). This is not true for the Coord1 value of style *bin/sphere* or *bin/cylinder* which both represent radial dimensions. Those values are always in distance [units](units). ------------------------------------------------------------------------ ## Restart, fix_modify, output, run start/stop, minimize info No information about this fix is written to [binary restart files](restart). None of the [fix_modify](fix_modify) options are relevant to this fix. This fix computes a global array of values which can be accessed by various [output commands](Howto_output). The values can only be accessed on timesteps that are multiples of $N_\text{freq}$, since that is when averaging is performed. The global array has \# of rows = the number of chunks $N_\text{chunk}$, as calculated by the specified [compute chunk/atom](compute_chunk_atom) command. The \# of columns is $M+1+N_\text{values}$, where $M \in \{1,\dotsc,4\}$, depending on whether the optional columns for OrigID and CoordN are used, as explained above. Following the optional columns, the next column contains the count of atoms in the chunk, and the remaining columns are the Nvalue quantities. When the array is accessed with a row $I$ that exceeds the current number of chunks, than a 0.0 is returned by the fix instead of an error, since the number of chunks can vary as a simulation runs depending on how that value is computed by the compute chunk/atom command. The array values calculated by this fix are treated as \"intensive\", since they are typically already normalized by the count of atoms in each chunk. No parameter of this fix can be used with the *start/stop* keywords of the [run](run) command. This fix is not invoked during [energy minimization](minimize). ## Restrictions > none ## Related commands [compute](compute), [fix ave/atom](fix_ave_atom), [fix ave/histo](fix_ave_histo), [fix ave/time](fix_ave_time), [variable](variable), [fix ave/correlate](fix_ave_correlate), [fix ave/atogrid](fix_ave_grid) ## Default The option defaults are norm = all, ave = one, bias = none, no file output, and title 1,2,3 = strings as described above.