pymatgen.apps.borg.hive module

class AbstractDrone[source]

Bases: monty.json.MSONable

Abstract drone class that defines the various methods that must be implemented by drones. Because of the quirky nature of Python”s multiprocessing, the intermediate data representations has to be in the form of python primitives. So all objects that drones work with must be MSONable. All drones must also implement the standard MSONable as_dict() and from_dict API.

assimilate(path)[source]

Assimilate data in a directory path into a pymatgen object. Because of the quirky nature of Python”s multiprocessing, the object must support pymatgen”s as_dict() for parallel processing.

Parameters:path – directory path
Returns:An assimilated object
get_valid_paths(path)[source]

Checks if path contains valid data for assimilation, and then returns the valid paths. The paths returned can be a list of directory or file paths, depending on what kind of data you are assimilating. For example, if you are assimilating VASP runs, you are only interested in directories containing vasprun.xml files. On the other hand, if you are interested converting all POSCARs in a directory tree to cifs for example, you will want the file paths.

Parameters:path – input path as a tuple generated from os.walk, i.e., (parent, subdirs, files).
Returns:List of valid dir/file paths for assimilation
class GaussianToComputedEntryDrone(inc_structure=False, parameters=None, data=None, file_extensions=('.log', ))[source]

Bases: pymatgen.apps.borg.hive.AbstractDrone

GaussianToEntryDrone assimilates directories containing Gaussian output to ComputedEntry/ComputedStructureEntry objects. By default, it is assumed that Gaussian output files have a ”.log” extension.

Parameters:
  • inc_structure (bool) – Set to True if you want ComputedStructureEntries to be returned instead of ComputedEntries.
  • parameters (list) – Input parameters to include. It has to be one of the properties supported by the GaussianOutput object. See pymatgen.io.gaussianio GaussianOutput. The parameters have to be one of python”s primitive types, i.e., list, dict of strings and integers. If parameters is None, a default set of parameters will be set.
  • data (list) – Output data to include. Has to be one of the properties supported by the GaussianOutput object. The parameters have to be one of python”s primitive types, i.e. list, dict of strings and integers. If data is None, a default set will be set.
  • file_extensions (list) – File extensions to be considered as Gaussian output files. Defaults to just the typical “log” extension.

Note

Like the GaussianOutput class, this is still in early beta.

as_dict()[source]
assimilate(path)[source]
classmethod from_dict(d)[source]
get_valid_paths(path)[source]
class SimpleVaspToComputedEntryDrone(inc_structure=False)[source]

Bases: pymatgen.apps.borg.hive.VaspToComputedEntryDrone

A simpler VaspToComputedEntryDrone. Instead of parsing vasprun.xml, it parses only the INCAR, POTCAR, OSZICAR and KPOINTS files, which are much smaller and faster to parse. However, much fewer properties are available compared to the standard VaspToComputedEntryDrone.

Parameters:inc_structure (bool) – Set to True if you want ComputedStructureEntries to be returned instead of ComputedEntries. Structure will be parsed from the CONTCAR.
as_dict()[source]
assimilate(path)[source]
classmethod from_dict(d)[source]
class VaspToComputedEntryDrone(inc_structure=False, parameters=None, data=None)[source]

Bases: pymatgen.apps.borg.hive.AbstractDrone

VaspToEntryDrone assimilates directories containing vasp output to ComputedEntry/ComputedStructureEntry objects. There are some restrictions on the valid directory structures:

  1. There can be only one vasp run in each directory.
  2. Directories designated “relax1”, “relax2” are considered to be 2 parts of an aflow style run, and only “relax2” is parsed.
  3. The drone parses only the vasprun.xml file.
Parameters:
  • inc_structure (bool) – Set to True if you want ComputedStructureEntries to be returned instead of ComputedEntries.
  • parameters (list) – Input parameters to include. It has to be one of the properties supported by the Vasprun object. See pymatgen.io.vasp.Vasprun. If parameters is None, a default set of parameters that are necessary for typical post-processing will be set.
  • data (list) – Output data to include. Has to be one of the properties supported by the Vasprun object.
as_dict()[source]
assimilate(path)[source]
classmethod from_dict(d)[source]
get_valid_paths(path)[source]